Roadmap
Upcoming enhancements to the Autobound Signal Database, including new data sources, schema improvements, and platform capabilities.
This roadmap outlines planned enhancements to the Signal Database. Timelines are estimates and subject to change based on customer feedback.
Feedback welcome: Have a signal you'd like to see or a schema improvement that would help? Contact [email protected]
Quick Navigation
Now (January–February 2026)
🔧 Schema Standardization
Competitor Domain Resolution
Currently, competitors_mentioned returns company names only. We're adding competitors_mentioned_domains to enable direct joins:
{
"data": {
"competitors_mentioned": ["Salesforce", "HubSpot"],
"competitors_mentioned_domains": ["salesforce.com", "hubspot.com"]
}
}Affected signals: 10-K, 10-Q, 8-K, 20-F, 6-K, Earnings Transcript, Reddit, News
Moderation Scores on Social Signals
Adding moderation_score (0.0–1.0, higher = safer) to all user-generated content signals:
{
"data": {
"moderation_score": 0.95,
"summary": "..."
}
}Affected signals: Reddit, LinkedIn Posts, LinkedIn Comments, Twitter/X, YouTube, G2 Reviews, Glassdoor Reviews
Confidence Score Standardization
Unifying confidence scoring across all signals to a consistent 0.0–1.0 scale, with both numeric and categorical values:
{
"data": {
"confidence": "high",
"confidence_score": 0.92
}
}| Enhancement | Status | Impact |
|---|---|---|
| Competitor domains in all payloads | 🟢 In Progress | Enables joins on competitors_mentioned |
| Moderation scores on social signals | 🟢 In Progress | Safety filtering for Reddit, LinkedIn, YouTube |
| Confidence scores standardized | 🟢 In Progress | Unified 0.0–1.0 scale across all signals |
📦 Delivery Improvements
Manifest Files for S3/GCS Drops
After each data drop, we'll write a manifest file to /manifest/YYYY-MM-DD_success.json:
{
"drop_date": "2026-01-15T00:00:00Z",
"buckets_updated": ["hiring-velocity", "job-changes", "news"],
"record_counts": {
"hiring-velocity": 145000,
"job-changes": 89000,
"news": 12500
}
}Status: Live (Beta) — Manifest files are now being generated. Contact your account manager for access.
This enables event-driven pipelines that trigger on manifest file creation.
Analytics Dashboard
We're building a self-service analytics dashboard for Signal Database customers:
- Record counts by signal type — See total signals available across your subscribed categories
- Historical trend visualization — Track signal volume over time to identify patterns
- Geographic filtering — Visualize and filter data by company headquarters region
- Industry segmentation — Analyze signals by industry vertical
- Coverage analysis — Upload your TAM and see hit rates across signal types
- CSV/Excel exports — Download filtered data for offline analysis
| Enhancement | Status | Impact |
|---|---|---|
| Manifest files for S3/GCS drops | 🟢 Live (Beta) | Enables event-driven pipelines |
| Larger sample files on README | 🟢 In Progress | Better evaluation data |
| Analytics dashboard | 🟢 In Progress | Self-service coverage analysis |
See Delivery for current bucket structure details.
⚡ Refresh Frequency Improvements
Our goal is to move all signal types to weekly refresh, then prioritize high-value signals for daily delivery.
| Signal Type | Current | Target | ETA |
|---|---|---|---|
| Job Changes | Monthly | Weekly | Feb 2026 |
| LinkedIn Posts | Monthly | Biweekly | Feb 2026 |
| LinkedIn Comments | Monthly | Biweekly | Feb 2026 |
| News | Monthly | Weekly | Feb 2026 |
| 10-K Filings | Quarterly | Weekly | Feb 2026 |
| 10-Q Filings | Quarterly | Weekly | Feb 2026 |
| 8-K Filings | Quarterly | Weekly | Feb 2026 |
| Earnings Transcripts | Quarterly | Weekly | Feb 2026 |
| 20-F Filings | Quarterly | Weekly | Feb 2026 |
| 6-K Filings | Quarterly | Weekly | Feb 2026 |
Daily refresh priority: Once weekly refresh is achieved across all categories, we will prioritize daily delivery for high-velocity signals like LinkedIn Posts and News.
See Signal Catalog for current refresh frequencies.
📊 Monitoring Pool Expansion
We're significantly expanding the number of companies and contacts in our monitoring pool across all signal categories.
| Milestone | Timeline | Growth |
|---|---|---|
| Phase 1 | Mid-February 2026 | +25–50% records per category |
| Phase 2 | Q2 2026 | 2× current size (doubling) |
This expansion improves coverage for:
- Mid-market and SMB companies
- International markets (EMEA, APAC)
- Emerging industries and verticals
See Signal Catalog for current coverage statistics by signal type.
🔗 Entity Resolution Improvements
We're actively improving the resolution quality of core company and contact fields included with each signal.
| Field | Current | Target |
|---|---|---|
company.linkedin_url | ~85% coverage | 95%+ |
company.domain | ~95% coverage | 99%+ |
contact.email | ~80% coverage | 90%+ |
contact.linkedin_url | ~90% coverage | 98%+ |
This work is actively in progress and will roll out incrementally across signal types.
🏭 Industry Normalization
We're standardizing the industries field across all signals. Currently, industry values vary by source (e.g., SEC filings use different industry classifications than LinkedIn-derived data).
Target: A normalized taxonomy of ~150 industries applied consistently across all signal types, with mapping to common standards (NAICS, SIC, LinkedIn industries).
This will enable more reliable industry-based filtering and segmentation.
Next (March–April 2026)
🕸️ Company Connections Flat File
A new bulk data product: the complete relationship graph for all companies in our database.
View Schema Example
{
"company_domain": "stripe.com",
"connections": [
{
"connected_domain": "shopify.com",
"connection_type": "customer",
"confidence": 0.92,
"first_seen": "2023-01-15",
"last_seen": "2026-01-10",
"source_url": "https://stripe.com/customers/shopify"
},
{
"connected_domain": "adyen.com",
"connection_type": "competitor",
"confidence": 0.88,
"first_seen": "2022-06-01",
"last_seen": "2026-01-10",
"source_url": "https://g2.com/compare/stripe-vs-adyen"
},
{
"connected_domain": "sequoiacap.com",
"connection_type": "investor",
"confidence": 0.99,
"first_seen": "2021-03-01",
"last_seen": "2026-01-10",
"source_url": "https://www.sequoiacap.com/companies/stripe"
}
]
}Connection types: competitor · customer · vendor · partner · investor
Enables:
- Resolving
competitors_mentionedin earnings transcripts to full company records - Building competitive conquest campaigns
- Identifying shared investors or vendors
See Competitive Companies for current API-based access.
🤝 Relationship Classification on LinkedIn
For LinkedIn comments, we'll classify the relationship between commenter and poster:
View Schema Example
{
"signal_type": "linkedin-comment",
"data": {
"relationship_type": "competitor",
"relationship_confidence": 0.85,
"commenter_company_domain": "hubspot.com",
"poster_company_domain": "salesforce.com"
}
}Relationship types: colleague · competitor · vendor · customer · friend · recruiter · unknown
This helps filter high-intent engagement from noise.
📚 Historical Backfill (12 Months)
Expanding historical access for all signal types:
| Signal Type | Current Lookback | Target |
|---|---|---|
| SEC Filings (10-K, 10-Q, 8-K) | 12 months | ✅ 12 months |
| News | 12 months | ✅ 12 months |
| Hiring Trends | 3 months | 12 months |
| Glassdoor | 3 months | 12 months |
| LinkedIn Posts | 1 month | 12 months |
| 1 month | 12 months |
Enables:
- Trend analysis ("hiring velocity ↑ 3 consecutive months")
- Predictive model training
- Better sampling for coverage analysis
🏷️ Cross-Signal Subtype Standardization
We're standardizing subtypes across signal sources. The same business event should have the same subtype regardless of source.
Leadership Changes
| Current (varies by source) | Standardized |
|---|---|
ceoChange (10-K) | leadershipChange_ceo |
hires + job_title_tags (News) | leadershipChange_ceo |
leadershipChange (8-K) | leadershipChange_ceo |
Funding Events
| Current (varies by source) | Standardized |
|---|---|
funding (News) | funding_seriesA, funding_seriesB, etc. |
receives_financing (News) | funding_seriesA, funding_seriesB, etc. |
Acquisition Events
| Current (varies by source) | Standardized |
|---|---|
acquisitionAnnounced (10-K) | acquisition_announced |
acquisitionCompleted (10-K) | acquisition_completed |
acquires (News) | acquisition_completed |
acquisitionActivity (Earnings) | acquisition_announced |
This enables cross-signal deduplication and unified filtering.
Later (Q2 2026)
🆕 New Data Sources
| Source | Entity | Description | ETA |
|---|---|---|---|
| Capterra | Company | Product reviews and ratings | Q2 2026 |
| TrustRadius | Company | Enterprise software reviews | Q2 2026 |
| Gartner Peer Insights | Company | Enterprise validation and reviews | Q2 2026 |
| PeerSpot | Company | IT/security-focused reviews | Q3 2026 |
| Quora | Company/Contact | Q&A discussions and expertise signals | Q2 2026 |
| Podcast Transcripts | Contact | Executive podcast appearances | Q2 2026 |
| Conference Talks | Contact | Speaking engagements and presentations | Q3 2026 |
New review sources will follow the G2 Reviews schema pattern and be unified under a product-reviews signal type with a source field for filtering.
📰 News Signal Enrichment
Expanding the metadata extracted from news events to match the depth of our SEC filing and Reddit signals:
New Fields for News Signals
| Field | Description |
|---|---|
competitors_mentioned | Competitor companies mentioned in the article |
competitors_mentioned_domains | Domains for direct joins |
technologies_mentioned | Technologies/products referenced |
vendors_mentioned | Vendor companies mentioned |
customers_mentioned | Customer companies mentioned |
use_of_funds | For funding events: stated use of funds |
Enhanced Funding Schema Example
{
"signal_type": "news",
"signal_subtype": "funding",
"data": {
"summary": "Acme Corp raises $50M Series B",
"amount": "$50M",
"amount_normalized": 50000000,
"financing_type": "Series B",
"investors": ["Sequoia Capital", "Andreessen Horowitz"],
"investor_domains": ["sequoiacap.com", "a16z.com"],
"lead_investor": "Sequoia Capital",
"use_of_funds": "Expand AI infrastructure and grow engineering team",
"competitors_mentioned": ["Competitor Corp"],
"competitors_mentioned_domains": ["competitor.com"],
"technologies_mentioned": ["AI", "Machine Learning"]
}
}This brings news signals to parity with the rich metadata already available in 10-K, Earnings Transcripts, and Reddit signals.
📋 Job Posting Content Analysis
A new signal type that extracts insights from the content of job postings, not just hiring velocity:
View Schema Example
{
"signal_type": "job-posting-content",
"signal_subtype": "tech_stack_signal",
"association": "company",
"company": {
"name": "Acme Corp",
"domain": "acme.com"
},
"data": {
"summary": "Hiring for Snowflake and dbt expertise across 5 open roles",
"technologies_mentioned": ["Snowflake", "dbt", "Airflow", "Python"],
"tools_mentioned": ["Salesforce", "HubSpot", "Tableau"],
"pain_points_mentioned": ["scaling data infrastructure", "real-time analytics"],
"departments": ["engineering", "data"],
"seniority_levels": ["senior", "staff"],
"role_count": 5,
"compensation_range": {
"low": 150000,
"high": 200000,
"currency": "USD"
}
}
}Available subtypes:
| Subtype | Description |
|---|---|
tech_stack_signal | Technologies and tools mentioned in job descriptions |
pain_point_signal | Challenges and problems mentioned ("scaling our infrastructure") |
budget_signal | Compensation ranges indicating budget levels |
team_buildout_signal | Evidence of building new teams or functions |
Why it matters: If a company is hiring for "Snowflake engineers," that's a technographic signal. If they mention "scaling challenges," that's a pain point.
🎤 Conference & Event Signals
Track company participation in industry events:
View Schema Example
{
"signal_type": "event",
"signal_subtype": "conference_sponsoring",
"association": "company",
"company": {
"name": "Acme Corp",
"domain": "acme.com"
},
"data": {
"summary": "Acme Corp is a Gold sponsor at SaaStr Annual 2026",
"event_name": "SaaStr Annual 2026",
"event_date": "2026-09-15",
"event_location": "San Francisco, CA",
"participation_type": "sponsor",
"sponsorship_level": "Gold",
"source_url": "https://saastrannual.com/sponsors"
}
}Available subtypes:
| Subtype | Description |
|---|---|
conference_attending | Company registered/attending an event |
conference_sponsoring | Company sponsoring an event |
conference_speaking | Executive speaking at event |
webinar_hosting | Company hosting a webinar |
👔 Executive Movement Graph
Beyond individual job changes, track the executive network:
View Schema Example
{
"company_domain": "acme.com",
"executive_connections": {
"alumni_sources": [
{
"company_domain": "google.com",
"executive_count": 3,
"titles": ["CTO", "VP Engineering", "Director of Product"]
}
],
"alumni_destinations": [
{
"company_domain": "startup.io",
"executive_count": 2,
"titles": ["CEO", "COO"]
}
],
"board_overlap": [
{
"company_domain": "portfolio-co.com",
"shared_board_members": ["Jane Smith"]
}
]
}
}Enables:
- "Their new CTO came from your customer Google" — warm intro signal
- Track where departing executives go
- Identify companies with shared board members or investors
📜 Regulatory & Compliance Signals
New subtypes for SEC filings focused on compliance and regulatory mentions:
| Subtype | Description |
|---|---|
gdpr_compliance | GDPR compliance discussed or achieved |
soc2_certification | SOC2 mentioned or certification achieved |
hipaa_compliance | Healthcare compliance signals |
fedramp_pursuit | Government compliance pursuit |
ai_governance | AI policy or governance framework mentioned |
data_privacy_initiative | Data privacy investments or initiatives |
Why it matters: Compliance is a major budget driver. "Company mentioned SOC2 in their 10-K" signals security spending.
🔌 API-First Signal Access
Real-time signal lookup via REST API:
curl -X GET "https://api.autobound.ai/signals/v1/company/stripe.com" \
-H "X-API-KEY: your-key" \
-d "signal_types=hiring-velocity,news,earnings" \
-d "max_age_days=30"Features:
- Fetch signals for a specific company or contact
- Filter by signal type, subtype, age
- Webhook subscriptions for new signals
This complements bulk delivery for point-of-action use cases.
🔄 Cross-Signal Deduplication
Improved deduplication across signal sources:
Deduplicated signals will include a related_signals array linking to other sources.
Platform Features (H2 2026)
Canonical Entity IDs Schema
{
"company": {
"autobound_id": "ab_company_12345",
"domain": "stripe.com",
"linkedin_url": "linkedin.com/company/stripe"
}
}Enables:
- Consistent joins across signal types
- Historical tracking of entity changes
- Simplified data modeling
Schema Standardization Summary
Fields Being Standardized
| Field | Current State | Target State |
|---|---|---|
relevance | 0–1 or 0–100 depending on signal | 0.0–1.0 everywhere |
confidence | String or float | Both: confidence (string) + confidence_score (float) |
sentiment | Present on some signals | Present on all signals |
competitors_mentioned | Names only | Names + domains |
technologies_mentioned | Inconsistent | Standardized taxonomy |
source_url | Various field names | source_url everywhere |
New Standard Fields (All Signals)
| Field | Type | Description |
|---|---|---|
moderation_score | float | Content safety (0.0–1.0, higher = safer) |
freshness_hours | integer | Hours since underlying event occurred |
evidence_count | integer | Number of sources supporting this signal |
Subtype Consolidation & Expansion
We're both consolidating fragmented subtypes and launching many new ones.
Consolidation
Current State: 400+ subtypes across 24 categories with inconsistencies:
- Similar subtypes with different names (
ceoChangevsleadershipChangevshires) - Legacy subtypes (
talentRetentionConcernsvsglassdoorTalentRetentionConcerns) - Inconsistent casing (
hiringVelocityvshiring_velocity)
Target State: Consistent naming convention → category_specificEvent
Examples: leadership_ceo_change · funding_series_a · acquisition_completed
Legacy subtypes will be supported via aliases for backward compatibility.
New Subtypes Coming
Earnings Transcripts
| New Subtype | Description |
|---|---|
guidance_beat | Company beat earnings guidance |
guidance_miss | Company missed earnings guidance |
margin_expansion | Profit margins expanding |
margin_compression | Profit margins compressing |
10-K / 10-Q Filings
| New Subtype | Description |
|---|---|
esg_commitment | ESG or sustainability commitment |
supply_chain_risk | Supply chain risk disclosed |
pricing_power | Pricing power mentioned |
customer_concentration_risk | Revenue concentration risk |
News
| New Subtype | Description |
|---|---|
executive_speaking | Executive speaking engagement |
conference_presentation | Company presenting at conference |
analyst_upgrade | Analyst upgraded stock rating |
analyst_downgrade | Analyst downgraded stock rating |
Hiring
| New Subtype | Description |
|---|---|
executive_search | Executive-level search underway |
team_buildout | Building new team/function |
replacement_hire | Backfilling departed employee |
| New Subtype | Description |
|---|---|
thought_leadership | Thought leadership content |
product_announcement | Product/feature announcement |
hiring_post | "We're hiring" post |
event_promotion | Event or webinar promotion |
| New Subtype | Description |
|---|---|
vendor_evaluation | Actively evaluating vendors |
implementation_question | Implementation help request |
feature_request | Feature request or wishlist |
migration_discussion | Discussing switching providers |
New subtypes will be documented in the Signal Catalog as they launch.
Stay Updated
Track all schema changes, new signals, and API updates in our Changelog.
Questions?
Updated about 9 hours ago
