Primer News Search ingests the LexisNexis news data set, a collection of 50,000+ english language sources from all over the world. The knowledge base of events, entities, and relationships is continually updating and has been backfilled with English-language data going back 2 years to present day.
In addition to English language documents, Primer ingests foreign language documents from the following languages, that are then translated into English:
Spanish - data from 2 years back to present day
Russian - data from Sept. 4, 2024 - present
Arabic - data from Sept. 4, 2024 - present
French - data from Oct 26, 2024 - present
Malay - data from Oct 26, 2024 - present
Indonesian - data from Oct 26, 2024 - present
Chinese (Both Traditional and Simplified) - data from Nov 14, 2024 - present
Japanese - data from Oct 30, 2025 - present
To get additional data sources and languages added to our ingestion pipeline, please reach out to [email protected]
