New Business Formation
Daily Secretary of State filings tracking new business incorporations—LLCs, corporations, LPs, and DBA registrations—across all 50 US states.
New Business Formation signals surface newly registered legal entities within hours of official state recording—giving go-to-market teams access to net-new businesses before they appear in any commercial database.
Autobound operates a proprietary ingestion and normalization infrastructure spanning all 50 US states. Each state exposes entity registration data through a distinct schema, format, and delivery mechanism—with significant variation in field naming, filing taxonomy, address structure, and update cadence. Our pipeline abstracts this complexity into a single, consistent schema, resolving cross-state inconsistencies in real time and enriching records with domain and contact data where available.
Each signal includes the legal entity name, entity type, principal address, registered agent, filing date, and jurisdiction. Records are enriched with domain resolution (SERP + LLM), officer LinkedIn profiles (SERP + verification), officer email, company LinkedIn, firmographic data (industry, employee count, description), and NAICS classification.
The result: you know the moment a new business is registered in your territory—before they've been cold-called, enriched, or added to any competitor's database. This is the earliest possible buying signal for SMB-focused sellers, franchise developers, insurance brokers, payroll providers, attorneys, accountants, and any GTM team targeting net-new businesses.
Priority states: New York, Colorado, Oregon, Pennsylvania, Connecticut, and Iowa are available at launch. Full 50-state coverage is on the roadmap — contact [email protected] for current state availability.
| Subtype Enum | Category | Description |
|---|---|---|
newBusinessFormation | formation | New business entity registered with state SOS |
The entity_type field in the data object (e.g. LLC, CORP, LP) provides granular filtering within this subtype.
Example Signal
{
"signal_id": "9944d8d8-e159-4ac5-98dc-34c765889d61",
"signal_type": "new-business-formation",
"signal_subtype": "newBusinessFormation",
"detected_at": "2026-03-01T00:00:00Z",
"association": "company",
"company": {
"name": "Zellerfeld Shoe Company Inc.",
"domain": "zellerfeld.com",
"linkedin_url": "linkedin.com/company/zellerfeld-inc/",
"industries": ["Retail Apparel And Fashion"],
"employee_count_low": 26,
"employee_count_high": 50,
"description": "Zellerfeld is the only technology company that produces a whole 3D-printed shoe with the push of a button..."
},
"data": {
"summary": "Zellerfeld Shoe Company Inc. filed as CORP_FOREIGN in CO on 2026-03-01.",
"entity_type": "CORP_FOREIGN",
"filing_number": "20261255801",
"filing_date": "2026-03-01",
"state_source": "CO",
"jurisdiction": "DE",
"city": "Austin",
"zip_code": "78753-1398",
"address": "905 West Howard Lane",
"county": null,
"status": "Good Standing",
"registered_agent": null,
"naics_code": null,
"registered_email": null,
"is_likely_shell": false,
"match_confidence": "very_high",
"officers": [
{
"name": "Cullan Kerner",
"title": "Creative Community Lead",
"linkedin_url": "linkedin.com/in/cullan-kerner-25368812a",
"match_confidence": "medium",
"email": "[email protected]"
}
],
"enrichment_calls": {
"serp": 2,
"daxa": 1,
"llm": 1
}
}
}Note on
company.domain: Brand-new entities typically don't have a website at time of filing. When Autobound can resolve a domain through enrichment, it will be populated. Expect null for ~65% of fresh formations.
Note on
match_confidence: This is the domain resolution confidence tier based on SERP + LLM scoring.very_high(score 9-10),high(7-8),medium(5-6),low(3-4),very_low(1-2).
Signal Fields
| Field | Type | Required | Description |
|---|---|---|---|
signal_id | string (UUID) | ✓ | Unique identifier for this signal |
signal_type | string | ✓ | Always "new-business-formation" |
signal_subtype | string | ✓ | Always "newBusinessFormation" |
detected_at | string (ISO 8601) | ✓ | Timestamp when Autobound ingested this filing |
association | string | ✓ | Always "company" |
Company Object
| Field | Type | Required | Description |
|---|---|---|---|
company.name | string | ✓ | Legal entity name as filed |
company.domain | string | Resolved website domain. Often null for brand-new entities at time of filing. | |
company.linkedin_url | string | Company LinkedIn URL (format: linkedin.com/company/{slug}) | |
company.industries | array[string] | Industry classifications from firmographic enrichment | |
company.employee_count_low | integer | Lower bound of employee count range | |
company.employee_count_high | integer | Upper bound of employee count range | |
company.description | string | Company description from firmographic enrichment |
Data Object
| Field | Type | Required | Description |
|---|---|---|---|
data.summary | string | ✓ | Human-readable one-line description of the filing event |
data.entity_type | string | ✓ | Normalized legal structure: LLC, CORP, CORP_FOREIGN, LP, LLP, NONPROFIT |
data.filing_number | string | ✓ | State-assigned filing or entity number |
data.filing_date | string | ✓ | Date the filing was recorded with the state SOS (ISO 8601 date) |
data.state_source | string | ✓ | US state where the entity was filed (2-letter abbreviation) |
data.jurisdiction | string | ✓ | State/jurisdiction of formation (may differ from state_source for foreign entities) |
data.city | string | City from principal address | |
data.zip_code | string | ZIP / postal code | |
data.address | string | Principal office street address | |
data.county | string | County of formation (available in NY, PA) | |
data.status | string | Entity status: Good Standing, Active, etc. | |
data.registered_agent | string | Name of the registered agent on file | |
data.naics_code | string | NAICS industry code (available in CT) | |
data.registered_email | string | Entity-level email from state registration (available in CT) | |
data.is_likely_shell | boolean | ✓ | Rule-based prefilter flagging likely shell/holding entities |
data.match_confidence | string | Domain resolution confidence tier: very_high, high, medium, low, very_low |
Officers Array
The data.officers array contains LinkedIn-verified officers/owners associated with the entity. May contain multiple entries (up to 3). Null when no officers could be resolved.
| Field | Type | Description |
|---|---|---|
data.officers[].name | string | Officer/owner full name |
data.officers[].title | string | Title or role at the entity |
data.officers[].linkedin_url | string | LinkedIn profile URL (format: linkedin.com/in/{slug}) |
data.officers[].match_confidence | string | Officer-to-entity match confidence: high or medium |
data.officers[].email | string | Business email address (when available from contact database) |
Enrichment Calls
The data.enrichment_calls object provides transparency into API calls made during enrichment. Useful for debugging and cost tracking.
| Field | Type | Description |
|---|---|---|
data.enrichment_calls.serp | integer | Number of SERP API calls (domain + officer lookups) |
data.enrichment_calls.daxa | integer | Number of Daxa profile verification calls |
data.enrichment_calls.llm | integer | Number of LLM calls (domain extraction) |
Delivery
- Refresh: Daily
- Format: JSONL + Parquet (Snappy compression)
- Bucket structure:
gs://autobound-new-business-formation/{YYYY-MM-DD-HH-MM-SS}/output.jsonlandoutput.parquet - Priority states: NY, CO, OR, PA, CT, IA (daily); remaining states available — contact [email protected]
- Coverage: All 50 US states on roadmap
Fill Rates (Current Coverage)
| Field | Overall | Notes |
|---|---|---|
company.domain | ~34% | Higher for established entities filing in new states |
company.linkedin_url | ~9% | From enrichment + firmographic match |
company.industries | ~6% | Where firmographic data available |
company.employee_count | ~8% | Where firmographic data available |
data.officers | ~7% | LinkedIn-verified, up to 3 per entity |
officers[].email | ~1.5% | From contact database match on LinkedIn URL |
data.registered_email | ~8% | CT only (provided on filing) |
data.naics_code | ~8% | CT only (provided on filing) |
data.county | ~48% | NY + PA provide county |
data.registered_agent | ~28% | IA, OR, CO provide agent name |
Updated 5 days ago
