Deduplication Impact

i

What This Shows: Article counts by source before deduplication, showing which search engines contributed most to the raw results.

Why It Matters: Duplicates from overlapping search results would skew analysis. 25.6% of articles were removed as duplicates.

How to Interpret: Bar length shows raw count. The legend shows before->after totals. Academic sources are prioritized when keeping duplicates.

Raw search results by source before duplicate removal
25.6% removed
2,295 duplicates
OPEN ↗
Academic
News
Other
8,955 -> 6,660 unique articles