Skip to main content

Data Sources in News Search

Description of the data set offered in News Search

Primer News Search ingests the LexisNexis news data set, a collection of 50,000+ english language sources from all over the world. The knowledge base of events, entities, and relationships is continually updating and has been backfilled with English-language data going back 2 years to present day.

In addition to English language documents, Primer ingests foreign language documents from the following languages, that are then translated into English:

  • Spanish - data from 2 years back to present day

  • Russian - data from Sept. 4, 2024 - present

  • Arabic - data from Sept. 4, 2024 - present

  • French - data from Oct 26, 2024 - present

  • Malay - data from Oct 26, 2024 - present

  • Indonesian - data from Oct 26, 2024 - present

  • Chinese (Both Traditional and Simplified) - data from Nov 14, 2024 - present

  • Japanese - data from Oct 30, 2025 - present

To get additional data sources and languages added to our ingestion pipeline, please reach out to [email protected]

Did this answer your question?