Roadmap

Upcoming enhancements to the Autobound Signal Database, including new data sources, schema improvements, and platform capabilities.

This roadmap outlines planned enhancements to the Signal Database. Timelines are estimates and subject to change based on customer feedback.

📬

Feedback welcome: Have a signal you'd like to see or a schema improvement that would help? Contact [email protected]


Quick Navigation


Now (January–February 2026)

🔧 Schema Standardization

Competitor Domain Resolution

Currently, competitors_mentioned returns company names only. We're adding competitors_mentioned_domains to enable direct joins:

{
  "data": {
    "competitors_mentioned": ["Salesforce", "HubSpot"],
    "competitors_mentioned_domains": ["salesforce.com", "hubspot.com"]
  }
}

Affected signals: 10-K, 10-Q, 8-K, 20-F, 6-K, Earnings Transcript, Reddit, News

Moderation Scores on Social Signals

Adding moderation_score (0.0–1.0, higher = safer) to all user-generated content signals:

{
  "data": {
    "moderation_score": 0.95,
    "summary": "..."
  }
}

Affected signals: Reddit, LinkedIn Posts, LinkedIn Comments, Twitter/X, YouTube, G2 Reviews, Glassdoor Reviews

Confidence Score Standardization

Unifying confidence scoring across all signals to a consistent 0.0–1.0 scale, with both numeric and categorical values:

{
  "data": {
    "confidence": "high",
    "confidence_score": 0.92
  }
}
EnhancementStatusImpact
Competitor domains in all payloads🟢 In ProgressEnables joins on competitors_mentioned
Moderation scores on social signals🟢 In ProgressSafety filtering for Reddit, LinkedIn, YouTube
Confidence scores standardized🟢 In ProgressUnified 0.0–1.0 scale across all signals

📦 Delivery Improvements

Manifest Files for S3/GCS Drops

After each data drop, we'll write a manifest file to /manifest/YYYY-MM-DD_success.json:

{
  "drop_date": "2026-01-15T00:00:00Z",
  "buckets_updated": ["hiring-velocity", "job-changes", "news"],
  "record_counts": {
    "hiring-velocity": 145000,
    "job-changes": 89000,
    "news": 12500
  }
}

Status: Live (Beta) — Manifest files are now being generated. Contact your account manager for access.

This enables event-driven pipelines that trigger on manifest file creation.

Analytics Dashboard

We're building a self-service analytics dashboard for Signal Database customers:

  • Record counts by signal type — See total signals available across your subscribed categories
  • Historical trend visualization — Track signal volume over time to identify patterns
  • Geographic filtering — Visualize and filter data by company headquarters region
  • Industry segmentation — Analyze signals by industry vertical
  • Coverage analysis — Upload your TAM and see hit rates across signal types
  • CSV/Excel exports — Download filtered data for offline analysis
EnhancementStatusImpact
Manifest files for S3/GCS drops🟢 Live (Beta)Enables event-driven pipelines
Larger sample files on README🟢 In ProgressBetter evaluation data
Analytics dashboard🟢 In ProgressSelf-service coverage analysis

See Delivery for current bucket structure details.


⚡ Refresh Frequency Improvements

Our goal is to move all signal types to weekly refresh, then prioritize high-value signals for daily delivery.

Signal TypeCurrentTargetETA
Job ChangesMonthlyWeeklyFeb 2026
LinkedIn PostsMonthlyBiweeklyFeb 2026
LinkedIn CommentsMonthlyBiweeklyFeb 2026
NewsMonthlyWeeklyFeb 2026
10-K FilingsQuarterlyWeeklyFeb 2026
10-Q FilingsQuarterlyWeeklyFeb 2026
8-K FilingsQuarterlyWeeklyFeb 2026
Earnings TranscriptsQuarterlyWeeklyFeb 2026
20-F FilingsQuarterlyWeeklyFeb 2026
6-K FilingsQuarterlyWeeklyFeb 2026

Daily refresh priority: Once weekly refresh is achieved across all categories, we will prioritize daily delivery for high-velocity signals like LinkedIn Posts and News.

See Signal Catalog for current refresh frequencies.


📊 Monitoring Pool Expansion

We're significantly expanding the number of companies and contacts in our monitoring pool across all signal categories.

MilestoneTimelineGrowth
Phase 1Mid-February 2026+25–50% records per category
Phase 2Q2 20262× current size (doubling)

This expansion improves coverage for:

  • Mid-market and SMB companies
  • International markets (EMEA, APAC)
  • Emerging industries and verticals

See Signal Catalog for current coverage statistics by signal type.


🔗 Entity Resolution Improvements

We're actively improving the resolution quality of core company and contact fields included with each signal.

FieldCurrentTarget
company.linkedin_url~85% coverage95%+
company.domain~95% coverage99%+
contact.email~80% coverage90%+
contact.linkedin_url~90% coverage98%+

This work is actively in progress and will roll out incrementally across signal types.


🏭 Industry Normalization

We're standardizing the industries field across all signals. Currently, industry values vary by source (e.g., SEC filings use different industry classifications than LinkedIn-derived data).

Target: A normalized taxonomy of ~150 industries applied consistently across all signal types, with mapping to common standards (NAICS, SIC, LinkedIn industries).

This will enable more reliable industry-based filtering and segmentation.


Next (March–April 2026)

🕸️ Company Connections Flat File

A new bulk data product: the complete relationship graph for all companies in our database.

View Schema Example
{
  "company_domain": "stripe.com",
  "connections": [
    {
      "connected_domain": "shopify.com",
      "connection_type": "customer",
      "confidence": 0.92,
      "first_seen": "2023-01-15",
      "last_seen": "2026-01-10",
      "source_url": "https://stripe.com/customers/shopify"
    },
    {
      "connected_domain": "adyen.com",
      "connection_type": "competitor",
      "confidence": 0.88,
      "first_seen": "2022-06-01",
      "last_seen": "2026-01-10",
      "source_url": "https://g2.com/compare/stripe-vs-adyen"
    },
    {
      "connected_domain": "sequoiacap.com",
      "connection_type": "investor",
      "confidence": 0.99,
      "first_seen": "2021-03-01",
      "last_seen": "2026-01-10",
      "source_url": "https://www.sequoiacap.com/companies/stripe"
    }
  ]
}

Connection types: competitor · customer · vendor · partner · investor

Enables:

  • Resolving competitors_mentioned in earnings transcripts to full company records
  • Building competitive conquest campaigns
  • Identifying shared investors or vendors

See Competitive Companies for current API-based access.


🤝 Relationship Classification on LinkedIn

For LinkedIn comments, we'll classify the relationship between commenter and poster:

View Schema Example
{
  "signal_type": "linkedin-comment",
  "data": {
    "relationship_type": "competitor",
    "relationship_confidence": 0.85,
    "commenter_company_domain": "hubspot.com",
    "poster_company_domain": "salesforce.com"
  }
}

Relationship types: colleague · competitor · vendor · customer · friend · recruiter · unknown

This helps filter high-intent engagement from noise.


📚 Historical Backfill (12 Months)

Expanding historical access for all signal types:

Signal TypeCurrent LookbackTarget
SEC Filings (10-K, 10-Q, 8-K)12 months✅ 12 months
News12 months✅ 12 months
Hiring Trends3 months12 months
Glassdoor3 months12 months
LinkedIn Posts1 month12 months
Reddit1 month12 months

Enables:

  • Trend analysis ("hiring velocity ↑ 3 consecutive months")
  • Predictive model training
  • Better sampling for coverage analysis

🏷️ Cross-Signal Subtype Standardization

We're standardizing subtypes across signal sources. The same business event should have the same subtype regardless of source.

Leadership Changes
Current (varies by source)Standardized
ceoChange (10-K)leadershipChange_ceo
hires + job_title_tags (News)leadershipChange_ceo
leadershipChange (8-K)leadershipChange_ceo
Funding Events
Current (varies by source)Standardized
funding (News)funding_seriesA, funding_seriesB, etc.
receives_financing (News)funding_seriesA, funding_seriesB, etc.
Acquisition Events
Current (varies by source)Standardized
acquisitionAnnounced (10-K)acquisition_announced
acquisitionCompleted (10-K)acquisition_completed
acquires (News)acquisition_completed
acquisitionActivity (Earnings)acquisition_announced

This enables cross-signal deduplication and unified filtering.


Later (Q2 2026)

🆕 New Data Sources

SourceEntityDescriptionETA
CapterraCompanyProduct reviews and ratingsQ2 2026
TrustRadiusCompanyEnterprise software reviewsQ2 2026
Gartner Peer InsightsCompanyEnterprise validation and reviewsQ2 2026
PeerSpotCompanyIT/security-focused reviewsQ3 2026
QuoraCompany/ContactQ&A discussions and expertise signalsQ2 2026
Podcast TranscriptsContactExecutive podcast appearancesQ2 2026
Conference TalksContactSpeaking engagements and presentationsQ3 2026

New review sources will follow the G2 Reviews schema pattern and be unified under a product-reviews signal type with a source field for filtering.


📰 News Signal Enrichment

Expanding the metadata extracted from news events to match the depth of our SEC filing and Reddit signals:

New Fields for News Signals
FieldDescription
competitors_mentionedCompetitor companies mentioned in the article
competitors_mentioned_domainsDomains for direct joins
technologies_mentionedTechnologies/products referenced
vendors_mentionedVendor companies mentioned
customers_mentionedCustomer companies mentioned
use_of_fundsFor funding events: stated use of funds
Enhanced Funding Schema Example
{
  "signal_type": "news",
  "signal_subtype": "funding",
  "data": {
    "summary": "Acme Corp raises $50M Series B",
    "amount": "$50M",
    "amount_normalized": 50000000,
    "financing_type": "Series B",
    "investors": ["Sequoia Capital", "Andreessen Horowitz"],
    "investor_domains": ["sequoiacap.com", "a16z.com"],
    "lead_investor": "Sequoia Capital",
    "use_of_funds": "Expand AI infrastructure and grow engineering team",
    "competitors_mentioned": ["Competitor Corp"],
    "competitors_mentioned_domains": ["competitor.com"],
    "technologies_mentioned": ["AI", "Machine Learning"]
  }
}

This brings news signals to parity with the rich metadata already available in 10-K, Earnings Transcripts, and Reddit signals.


📋 Job Posting Content Analysis

A new signal type that extracts insights from the content of job postings, not just hiring velocity:

View Schema Example
{
  "signal_type": "job-posting-content",
  "signal_subtype": "tech_stack_signal",
  "association": "company",
  "company": {
    "name": "Acme Corp",
    "domain": "acme.com"
  },
  "data": {
    "summary": "Hiring for Snowflake and dbt expertise across 5 open roles",
    "technologies_mentioned": ["Snowflake", "dbt", "Airflow", "Python"],
    "tools_mentioned": ["Salesforce", "HubSpot", "Tableau"],
    "pain_points_mentioned": ["scaling data infrastructure", "real-time analytics"],
    "departments": ["engineering", "data"],
    "seniority_levels": ["senior", "staff"],
    "role_count": 5,
    "compensation_range": {
      "low": 150000,
      "high": 200000,
      "currency": "USD"
    }
  }
}

Available subtypes:

SubtypeDescription
tech_stack_signalTechnologies and tools mentioned in job descriptions
pain_point_signalChallenges and problems mentioned ("scaling our infrastructure")
budget_signalCompensation ranges indicating budget levels
team_buildout_signalEvidence of building new teams or functions

Why it matters: If a company is hiring for "Snowflake engineers," that's a technographic signal. If they mention "scaling challenges," that's a pain point.


🎤 Conference & Event Signals

Track company participation in industry events:

View Schema Example
{
  "signal_type": "event",
  "signal_subtype": "conference_sponsoring",
  "association": "company",
  "company": {
    "name": "Acme Corp",
    "domain": "acme.com"
  },
  "data": {
    "summary": "Acme Corp is a Gold sponsor at SaaStr Annual 2026",
    "event_name": "SaaStr Annual 2026",
    "event_date": "2026-09-15",
    "event_location": "San Francisco, CA",
    "participation_type": "sponsor",
    "sponsorship_level": "Gold",
    "source_url": "https://saastrannual.com/sponsors"
  }
}

Available subtypes:

SubtypeDescription
conference_attendingCompany registered/attending an event
conference_sponsoringCompany sponsoring an event
conference_speakingExecutive speaking at event
webinar_hostingCompany hosting a webinar

👔 Executive Movement Graph

Beyond individual job changes, track the executive network:

View Schema Example
{
  "company_domain": "acme.com",
  "executive_connections": {
    "alumni_sources": [
      {
        "company_domain": "google.com",
        "executive_count": 3,
        "titles": ["CTO", "VP Engineering", "Director of Product"]
      }
    ],
    "alumni_destinations": [
      {
        "company_domain": "startup.io",
        "executive_count": 2,
        "titles": ["CEO", "COO"]
      }
    ],
    "board_overlap": [
      {
        "company_domain": "portfolio-co.com",
        "shared_board_members": ["Jane Smith"]
      }
    ]
  }
}

Enables:

  • "Their new CTO came from your customer Google" — warm intro signal
  • Track where departing executives go
  • Identify companies with shared board members or investors

📜 Regulatory & Compliance Signals

New subtypes for SEC filings focused on compliance and regulatory mentions:

SubtypeDescription
gdpr_complianceGDPR compliance discussed or achieved
soc2_certificationSOC2 mentioned or certification achieved
hipaa_complianceHealthcare compliance signals
fedramp_pursuitGovernment compliance pursuit
ai_governanceAI policy or governance framework mentioned
data_privacy_initiativeData privacy investments or initiatives

Why it matters: Compliance is a major budget driver. "Company mentioned SOC2 in their 10-K" signals security spending.


🔌 API-First Signal Access

Real-time signal lookup via REST API:

curl -X GET "https://api.autobound.ai/signals/v1/company/stripe.com" \
  -H "X-API-KEY: your-key" \
  -d "signal_types=hiring-velocity,news,earnings" \
  -d "max_age_days=30"

Features:

  • Fetch signals for a specific company or contact
  • Filter by signal type, subtype, age
  • Webhook subscriptions for new signals

This complements bulk delivery for point-of-action use cases.


🔄 Cross-Signal Deduplication

Improved deduplication across signal sources:

EventSourcesDedup Logic
Funding roundNews, 10-K, EarningsSame company + amount + 30-day window
Leadership changeNews, 10-K, 8-K, Job ChangeSame company + person + 30-day window
AcquisitionNews, 10-K, 8-K, EarningsSame acquirer + target + 60-day window

Deduplicated signals will include a related_signals array linking to other sources.


Platform Features (H2 2026)

Canonical Entity IDs Schema
{
  "company": {
    "autobound_id": "ab_company_12345",
    "domain": "stripe.com",
    "linkedin_url": "linkedin.com/company/stripe"
  }
}

Enables:

  • Consistent joins across signal types
  • Historical tracking of entity changes
  • Simplified data modeling

Schema Standardization Summary

Fields Being Standardized

FieldCurrent StateTarget State
relevance0–1 or 0–100 depending on signal0.0–1.0 everywhere
confidenceString or floatBoth: confidence (string) + confidence_score (float)
sentimentPresent on some signalsPresent on all signals
competitors_mentionedNames onlyNames + domains
technologies_mentionedInconsistentStandardized taxonomy
source_urlVarious field namessource_url everywhere

New Standard Fields (All Signals)

FieldTypeDescription
moderation_scorefloatContent safety (0.0–1.0, higher = safer)
freshness_hoursintegerHours since underlying event occurred
evidence_countintegerNumber of sources supporting this signal

Subtype Consolidation & Expansion

We're both consolidating fragmented subtypes and launching many new ones.

Consolidation

Current State: 400+ subtypes across 24 categories with inconsistencies:

  • Similar subtypes with different names (ceoChange vs leadershipChange vs hires)
  • Legacy subtypes (talentRetentionConcerns vs glassdoorTalentRetentionConcerns)
  • Inconsistent casing (hiringVelocity vs hiring_velocity)

Target State: Consistent naming convention → category_specificEvent

Examples: leadership_ceo_change · funding_series_a · acquisition_completed

Legacy subtypes will be supported via aliases for backward compatibility.

New Subtypes Coming

Earnings Transcripts
New SubtypeDescription
guidance_beatCompany beat earnings guidance
guidance_missCompany missed earnings guidance
margin_expansionProfit margins expanding
margin_compressionProfit margins compressing
10-K / 10-Q Filings
New SubtypeDescription
esg_commitmentESG or sustainability commitment
supply_chain_riskSupply chain risk disclosed
pricing_powerPricing power mentioned
customer_concentration_riskRevenue concentration risk
News
New SubtypeDescription
executive_speakingExecutive speaking engagement
conference_presentationCompany presenting at conference
analyst_upgradeAnalyst upgraded stock rating
analyst_downgradeAnalyst downgraded stock rating
Hiring
New SubtypeDescription
executive_searchExecutive-level search underway
team_buildoutBuilding new team/function
replacement_hireBackfilling departed employee
LinkedIn
New SubtypeDescription
thought_leadershipThought leadership content
product_announcementProduct/feature announcement
hiring_post"We're hiring" post
event_promotionEvent or webinar promotion
Reddit
New SubtypeDescription
vendor_evaluationActively evaluating vendors
implementation_questionImplementation help request
feature_requestFeature request or wishlist
migration_discussionDiscussing switching providers

New subtypes will be documented in the Signal Catalog as they launch.


Stay Updated

Track all schema changes, new signals, and API updates in our Changelog.


Questions?