Datasets Use Cases Research

What Is Alternative Data? Types, Examples, and How Investors Use It

Alternative data is any information used for investment research that comes from outside traditional financial sources: earnings reports, balance sheets, guidance, and analyst estimates. The category is broad. It includes everything from satellite imagery of parking lots to search volume trends, credit card transactions, TikTok hashtag counts, web traffic, and news sentiment scores.

The core idea is simple: financial statements describe what already happened. Alternative data can describe what is happening now, or what is about to happen. For investors trying to anticipate earnings, track demand, or get ahead of shifts in consumer behavior, that time advantage is the value proposition.


Why alternative data exists as a category

Investment research has always sought non-public information advantages. What changed over the last decade is that the internet and digital commerce created an enormous volume of behavioral signals that are technically public but practically hard to aggregate, normalize, and connect to investable names. That created a market for data vendors who do the collection, processing, and mapping work on behalf of institutional investors.

The other driver is quant and systematic investing. Factor-based strategies and quantitative funds need data that can be ingested systematically, updated at high frequency, and combined with other datasets. Traditional financial data alone does not have enough variation or lead time to differentiate signals. Alternative data fills that gap.


The main types of alternative data

1. Search and intent data

Search volume from Google, Amazon, YouTube, and similar platforms reflects what consumers are actively looking for. A rising trend in branded search often precedes demand growth. Falling search share within a category can indicate competitive pressure. Search data is one of the most widely used alternative data types because it is available historically, it normalizes cleanly across companies and categories, and it maps naturally to investable names.

Examples: Google Search volume for a brand name, Amazon product search trends, YouTube video search for a topic, Google Shopping interest for a product category.

2. Social and engagement data

Data from social platforms captures awareness, cultural relevance, and word-of-mouth momentum. Unlike search, which reflects intent, social engagement reflects discovery and sharing. The two often move together but can diverge meaningfully, and that divergence is often informative.

Examples: TikTok hashtag volume for a brand or product, Reddit discussion volume and sentiment for a company, Instagram engagement trends.

3. News sentiment and volume

Structured analysis of news articles, earnings call transcripts, regulatory filings, and web content. Sentiment scores, entity mentions, and topic classification allow investors to track how the narrative around a company or sector is evolving, independent of what prices are doing.

Examples: Daily sentiment scores for a company based on news coverage, earnings call tone analysis, volume of negative vs. positive coverage over time.

4. Web and app traffic

Estimates of how many people visit a website or use an app, derived from panel data, ISP data, or similar sources. Web traffic is a proxy for demand and engagement, particularly useful for digital-first businesses or companies with significant e-commerce exposure.

Examples: Monthly unique visitors to a retailer's website, app download estimates and active user trends, time-on-site metrics.

5. Transaction and card data

Aggregated and anonymized credit and debit card spend, or receipt-level transaction data, used to estimate consumer spending at specific companies or categories. Often considered the closest alternative data to actual reported revenue.

Examples: Weekly same-store sales estimates derived from card spend, category spend trends, geographic spend comparisons.

6. Location and foot traffic data

Aggregated mobile device location data used to estimate how many people visit physical locations: stores, restaurants, offices, factories. Particularly relevant for retail, restaurants, commercial real estate, and industrial names.

Examples: Monthly foot traffic index for a restaurant chain, store visit counts relative to the prior year, parking lot occupancy estimates.

7. Satellite and geospatial data

Satellite imagery analyzed to count cars in parking lots, measure factory activity, track oil tank levels, or estimate agricultural yields. One of the earliest and most well-known alternative data types.

Examples: Parking lot occupancy at big-box retailers, ship tracking data for supply chain analysis, crop yield estimates from spectral imaging.

8. Wikipedia and reference traffic

Page views for Wikipedia articles about companies, products, or topics. A surprisingly clean proxy for general public awareness, used as a leading indicator of earnings surprises in academic research.

Examples: Spike in Wikipedia views for a company ahead of a news event, rising views for a product category entering mainstream awareness.


How institutional investors use it

Pre-earnings analysis. The most common use case. Analysts and portfolio managers review alternative data signals in the weeks before an earnings report to assess whether the quarter is tracking above or below consensus. Search trends, social engagement, web traffic, and card data are all used in different combinations depending on the company type.

Thematic and sector research. Identifying which sectors, categories, or themes are gaining or losing consumer attention. Alternative data can surface a thematic shift (e.g. rising interest in a product category, declining engagement with a specific format) months before it is visible in financial results.

Idea generation. Screening for names where behavioral signals are moving in a direction that consensus has not yet reflected. Search volume accelerating for a mid-cap brand with thin analyst coverage is an example of a signal that could trigger deeper fundamental research.

Risk monitoring. Using alternative data as an early warning system. Brand deterioration, declining web traffic, weakening search trends, and rising negative sentiment can each be tracked continuously and used to flag positions before problems show up in earnings.

Competitive intelligence. Comparing signals across companies in the same sector: who is gaining search share, whose web traffic is growing faster, whose social engagement is outpacing the category.


What to look for in alternative data

Not all alternative data is the same quality. For institutional use, the meaningful criteria are:

  • Historical depth. You need years of history to backtest, normalize, and understand how a signal behaves in different environments.
  • Consistent methodology. The way a signal is calculated should not change over time. If the methodology shifts, historical comparisons break down.
  • Entity mapping. The data needs to be connectable to specific tickers, companies, or sectors, not just raw keywords or addresses.
  • Update frequency. Weekly or daily data is usable for most investment processes. Monthly data limits the lead-time advantage.
  • Normalization. Absolute volume estimates and normalized indices that are comparable across companies and time periods are more useful than raw counts that reflect platform size changes.

Where Paradox Intelligence fits

Paradox Intelligence provides normalized, historically consistent alternative data across search (Google, Amazon, YouTube, Google Shopping), social (TikTok, Reddit), news sentiment, web traffic, and Wikipedia, all mapped to companies and themes. Data is available via platform, REST API, and MCP server for AI-integrated workflows. For the full dataset catalog, see Datasets.


For a deeper look at specific source types, see 5 Alternative Data Sources Hedge Funds Use Most and Research.


- Find Your Plan

This post is for institutional investors and research professionals. It is not investment advice.

BUILT BY INVESTORS, FOR INVESTORS