Social/Behavioral

News Consumption Data

Buy and sell news consumption data data. Which stories get read, shared, and debated - by political leaning, age, and platform. Media companies make editorial decisions blind without this.

PDFExcelXMLLAS

No listings currently in the marketplace for News Consumption Data.

Find Me This Data →

Overview

What Is News Consumption Data?

News consumption data tracks which stories readers click, share, and engage with across digital platforms. This behavioral dataset captures implicit feedback through browsing patterns, clicks, and reading time, revealing what content resonates with audiences. Publishers use this data to understand audience preferences, optimize editorial decisions, and improve content recommendations. The data can segment consumption patterns by user demographics, device type, location, and platform, providing insights into how different audience segments engage with news topics and formats.

Market Data

1,000,000

MIND Dataset Users

Source: arXiv

24.2 million

MIND Dataset Clicks

Source: arXiv

1.1 million

EB-NeRD Dataset Users

Source: arXiv

38 million

EB-NeRD Dataset Clicks

Source: arXiv

3.1 million

Adressa Dataset Users

Source: arXiv

Who Uses This Data

What AI models do with it.do with it.

01

News Recommendation Systems

Publishers and aggregators build machine learning models to predict which stories individual users will click or read, personalizing the news feed experience.

02

Editorial Strategy & Content Optimization

Editorial teams analyze consumption patterns to understand which topics, formats, and angles drive engagement, informing coverage decisions and resource allocation.

03

Reader Preference Analysis

Media organizations segment audiences by demographics, location, and device to understand how different reader groups consume news and what content gaps exist.

04

AI-Powered News Customization

News platforms use consumption data to test and deploy customization features that let readers specify their preferred topics and presentation formats.

What Can You Earn?

What it's worth.worth.

Research/Academic License

Varies

Open datasets like EB-NeRD and MIND are publicly available under CC BY licenses for research and non-commercial use.

Commercial Data Access

Varies

Most proprietary news consumption datasets from major publishers and aggregators are not publicly available; pricing negotiated case-by-case.

Anonymized Click Stream

Varies

Aggregated consumption metrics including user-id, article-id, event-time, and read-time are the standard data elements publishers monetize.

What Buyers Expect

What makes it valuable.valuable.

01

Rich User Context

Data should include user identifiers, session boundaries, device type, OS, location, and subscription status to enable segmentation and personalization.

02

Article Metadata

Complete content information including title, abstract, body text, category, entities, URL, publish time, and subcategories enables better recommendation and analysis.

03

Behavioral Timestamps

Precise event-time and read-time metrics are essential for modeling user interest evolution and detecting temporal patterns in news consumption.

04

Scale & Representativeness

Datasets should contain millions of users and clicks across diverse publishers or aggregators to ensure findings generalize beyond a single editorial context.

Companies Active Here

Who's buying.buying.

MSN News

Major news aggregator operating the MIND dataset containing 1 million users and 24+ million clicks; aggregates consumption data across many publishers.

JP/Politikens Media Group

Danish publisher that created and maintains the EB-NeRD large-scale news recommendation dataset with 1.1 million users and 38 million clicks.

Adressa (Schibsted)

Norwegian digital news publisher operating the Adressa dataset with 3.1 million users and 27+ million clicks, tracking subscription status and device behavior.

FAQ

Common questions.questions.

How do news publishers currently use consumption data?

Publishers use news consumption data to build personalized recommendation systems, understand audience interests through implicit feedback (clicks and reads), segment readers by geography and device, and make editorial decisions. However, reporters and editors often cannot predict what readers will care about before publishing, making consumption data critical post-publication feedback.

What's the difference between news aggregator data and publisher data?

News aggregators like MSN News compile stories from many sources and capture broad user behavior, while digital publishers own their own content and reader relationships. Aggregator datasets (like MIND) and publisher datasets (like EB-NeRD and Adressa) differ in language, user count, content richness, and features. Insights from aggregators don't always transfer directly to individual publishers with different content strategies.

What data points are typically available in news consumption datasets?

Standard data includes user identifiers, article IDs, click timestamps, and article metadata (title, abstract, category). Enhanced datasets add read-time, publish-time, session boundaries, user location, device type, OS, and subscription status. Article text and entities enable deeper content analysis beyond recommendations.

How is news consumption data collected?

Data is collected through browser tracking (implicit feedback from clicks, page views, and time spent reading) rather than explicit user ratings. Users are identified by anonymous IDs tied to sessions, devices, or accounts. Major datasets are maintained by news publishers and aggregators at scale—from 1 to 3+ million users—to ensure representative insights.

Sell yournews consumptiondata.

If your company generates news consumption data, AI companies are actively looking for it. We handle pricing, compliance, and buyer matching.

Request Valuation