Familiarize yourself with the NLP features available in News API v3
Component | Description | Plan Requirement |
---|---|---|
Theme | General topic or category of the article | v3_nlp |
Summary | Concise overview of the article’s content | v3_nlp |
Sentiment | Separate scores for title and content sentiment | v3_nlp |
Named Entities | Identified persons, organizations, locations, and miscellaneous entities | v3_nlp |
Translations | English translations of non-English content (one-way only): title, content, summary, named entities | v3_nlp |
IPTC Tags | Standardized news category tags | v3_nlp_iptc_tags |
IAB Tags | Content categories for digital advertising | v3_nlp_iptc_tags |
Custom Tags | Organization-specific classification system | All v3 NLP plans |
Embeddings | 1024-dimensional vector representation for semantic similarity | v3_nlp_embeddings |
include_nlp_data
(boolean): Set to true
to include the NLP object for each
article in response.has_nlp
(boolean): Set to true
to filter the results to only articles with
available NLP data.include_translation_fields
(boolean): Set to true
to include translation
fields in the response.null
if specific analyses
were not performed on the article. The full data is available for articles in
English and Arabic only. Translation features are only available with the NLP
plan or higher.include_nlp_data=true, has_nlp=false
: Returns all matching articles with the
NLP object included in each. The completeness of NLP data varies by language.include_nlp_data=true, has_nlp=true
: Returns only articles processed with
NLP. This combination filters out many articles in languages other than
English and Arabic.include_nlp_data=false
: The NLP object is not included in the response,
regardless of the has_nlp
value.include_translation_fields=true
: Includes translation fields
(title_translated_en
and content_translated_en
) in the response.Feature | English & Arabic | Other Languages | Coverage |
---|---|---|---|
Theme classification | ✓ | ✓ (limited) | 10% of non-English/Arabic articles |
Summary | ✓ | ✓ (limited) | 10% of non-English/Arabic articles |
Sentiment analysis | ✓ | ✗ | 100% of English/Arabic articles |
Named entity recognition | ✓ | ✓ | 100% of English/Arabic articles and all non-English articles via translations |
Content tags | ✓ | ✓ (limited) | 10% of non-English/Arabic articles |
Vector embeddings | ✓ | ✓ | Nearly 100% of all articles |
Clustering | ✓ | ✓ | All articles |
Deduplication | ✓ | ✓ | All articles |
Translations to English | ✗ | ✓ | All non-English articles |
Search with English translations | ✗ | ✓ | All non-English articles |
has_nlp=true
substantially reduces the result set.Response example
Business
Economics
Entertainment
Finance
Health
Politics
Science
Sports
Tech
Crime
Financial Crime
Lifestyle
Automotive
Travel
Weather
General
theme
and not_theme
parameters to filter articles based on their
classified themes:
theme
(string): Includes articles matching the specified theme(s).not_theme
(string): Excludes articles matching the specified theme(s).search_in
parameter:
climate change
within article summaries, potentially
yielding more relevant results than searching the full content.
title_sentiment_min
and title_sentiment_max
(float): Filter by title
sentimentcontent_sentiment_min
and content_sentiment_max
(float): Filter by content
sentimentPER_entity_name
: Person namesORG_entity_name
: Organization namesLOC_entity_name
: Location namesMISC_entity_name
: Miscellaneous entities (events, nationalities, products,
works of art, etc.)NER Type | Language Coverage | Response Fields |
---|---|---|
Original Content | 100% English, 100% Arabic, ~10% other languages | nlp.ner_PER , nlp.ner_ORG , nlp.ner_LOC , nlp.ner_MISC |
Translation-Based | 100% all languages (via English translations) | nlp.translation_ner_PER , nlp.translation_ner_ORG , nlp.translation_ner_LOC , nlp.translation_ner_MISC |
AND
, OR
, NOT
), proximity
search with NEAR
, and count-based filtering.
To learn more about NER, see
How to search by entity.
search_in
parameter with translation options:
title_translated
: Search in English translations of titlescontent_translated
: Search in English translations of contentsummary_translated
: Search in summaries of English translationstitle_content_translated
: Search in both English translated titles and
contenttitle_translated_en
: English translation of the article titlecontent_translated_en
: English translation of the article contentnlp.summary_translated
: Brief AI-generated summary of the English
translationnlp.translation_ner_PER
: Person entities extracted from English translationsnlp.translation_ner_ORG
: Organization entities extracted from English
translationsnlp.translation_ner_LOC
: Location entities extracted from English
translationsnlp.translation_ner_MISC
: Miscellaneous entities extracted from English
translationsv3_nlp_iptc_tags
plan. Custom tags are developed
upon request and are available in all NLP plans.
iptc_tags
(string): Includes articles with specified IPTC tags.not_iptc_tags
(string): Excludes articles with specified IPTC tags.20000002
encodes arts and entertainment.
For a complete IPTC Media Topic NewsCodes list, visit the
IPTC website.
iab_tags
(string): Includes articles with specified IAB tags.not_iab_tags
(string): Excludes articles with specified IAB tags.Business
or
Investing
but not Personal Finance
.
For more information on IAB Content Taxonomy, visit the
IAB Tech Lab website.
custom_tags
parameter
following this pattern:
"custom_tags.taxonomy": "Tag1,Tag2,Tag3"
,taxonomy
is your taxonomy name and Tag1,Tag2,Tag3
are specific tags.
To specify multiple tags:
GET
requests, use a comma-separated string.POST
requests, use a comma-separated string or an array of strings.v3_nlp_embeddings
plan, each
article is processed through the
multilingual-e5-large model
to generate its vector representation.
The embedding is available in the new_embedding
field as an array of 1024
numbers. Here’s an example of how it appears in the API response:
Application | Description | Example use case |
---|---|---|
Brand Monitoring | Track mentions, analyze sentiment and identify influencers. | A tech company monitoring public perception of their latest product launch. |
Competitive Intelligence | Monitor competitors’ activities and public perception. | An automotive manufacturer tracking mentions of competitors’ electric vehicle initiatives. |
Market Research | Analyze trends, consumer sentiment, and emerging topics. | A financial services firm identifying emerging fintech trends. |
Political Analysis | Track political figures and analyze public opinion. | A political campaign monitoring sentiment around key policy issues. |
Financial Analysis | Monitor market sentiment and track company mentions. | An investment firm analyzing sentiment around potential acquisition targets. |
Academic Research | Conduct large-scale analysis of media coverage. | A researcher studying media bias in climate change reporting. |
Content Curation | Automatically filter and categorize news content. | A news aggregator app personalizing content for users based on interests. |
Trend Forecasting | Identify emerging trends across industries. | A consulting firm predicting future technology adoption trends. |