Deduplication Impact

i

What This Shows: Article counts by source before deduplication, showing which search engines contributed most to the raw results.

Why It Matters: Duplicates from overlapping search results would skew analysis. 25.7% of articles were removed as duplicates.

How to Interpret: Bar length shows raw count. The legend shows before->after totals. Academic sources are prioritized when keeping duplicates.

Raw search results by source before duplicate removal
25.7% removed
2,122 duplicates
OPEN ↗
Academic
News
Other
8,257 -> 6,135 unique articles