New Business Formation

Daily Secretary of State filings tracking new business incorporations—LLCs, corporations, LPs, and DBA registrations—across all 50 US states.

New Business Formation signals surface newly registered legal entities within hours of official state recording—giving go-to-market teams access to net-new businesses before they appear in any commercial database.

Autobound operates a proprietary ingestion and normalization infrastructure spanning all 50 US states. Each state exposes entity registration data through a distinct schema, format, and delivery mechanism—with significant variation in field naming, filing taxonomy, address structure, and update cadence. Our pipeline abstracts this complexity into a single, consistent schema, resolving cross-state inconsistencies in real time and enriching records with domain and contact data where available.

Each signal includes the legal entity name, entity type, principal address, registered agent, filing date, and jurisdiction. Records are enriched with domain resolution (SERP + LLM), officer LinkedIn profiles (SERP + verification), officer email, company LinkedIn, firmographic data (industry, employee count, description), and NAICS classification.

The result: you know the moment a new business is registered in your territory—before they've been cold-called, enriched, or added to any competitor's database. This is the earliest possible buying signal for SMB-focused sellers, franchise developers, insurance brokers, payroll providers, attorneys, accountants, and any GTM team targeting net-new businesses.

Priority states: New York, Colorado, Oregon, Pennsylvania, Connecticut, and Iowa are available at launch. Full 50-state coverage is on the roadmap — contact [email protected] for current state availability.

Subtype EnumCategoryDescription
newBusinessFormationformationNew business entity registered with state SOS

The entity_type field in the data object (e.g. LLC, CORP, LP) provides granular filtering within this subtype.

Example Signal

{
  "signal_id": "9944d8d8-e159-4ac5-98dc-34c765889d61",
  "signal_type": "new-business-formation",
  "signal_subtype": "newBusinessFormation",
  "detected_at": "2026-03-01T00:00:00Z",
  "association": "company",

  "company": {
    "name": "Zellerfeld Shoe Company Inc.",
    "domain": "zellerfeld.com",
    "linkedin_url": "linkedin.com/company/zellerfeld-inc/",
    "industries": ["Retail Apparel And Fashion"],
    "employee_count_low": 26,
    "employee_count_high": 50,
    "description": "Zellerfeld is the only technology company that produces a whole 3D-printed shoe with the push of a button..."
  },

  "data": {
    "summary": "Zellerfeld Shoe Company Inc. filed as CORP_FOREIGN in CO on 2026-03-01.",
    "entity_type": "CORP_FOREIGN",
    "filing_number": "20261255801",
    "filing_date": "2026-03-01",
    "state_source": "CO",
    "jurisdiction": "DE",
    "city": "Austin",
    "zip_code": "78753-1398",
    "address": "905 West Howard Lane",
    "county": null,
    "status": "Good Standing",
    "registered_agent": null,
    "naics_code": null,
    "registered_email": null,
    "is_likely_shell": false,
    "match_confidence": "very_high",
    "officers": [
      {
        "name": "Cullan Kerner",
        "title": "Creative Community Lead",
        "linkedin_url": "linkedin.com/in/cullan-kerner-25368812a",
        "match_confidence": "medium",
        "email": "[email protected]"
      }
    ],
    "enrichment_calls": {
      "serp": 2,
      "daxa": 1,
      "llm": 1
    }
  }
}

Note on company.domain: Brand-new entities typically don't have a website at time of filing. When Autobound can resolve a domain through enrichment, it will be populated. Expect null for ~65% of fresh formations.

Note on match_confidence: This is the domain resolution confidence tier based on SERP + LLM scoring. very_high (score 9-10), high (7-8), medium (5-6), low (3-4), very_low (1-2).


Signal Fields

FieldTypeRequiredDescription
signal_idstring (UUID)Unique identifier for this signal
signal_typestringAlways "new-business-formation"
signal_subtypestringAlways "newBusinessFormation"
detected_atstring (ISO 8601)Timestamp when Autobound ingested this filing
associationstringAlways "company"

Company Object

FieldTypeRequiredDescription
company.namestringLegal entity name as filed
company.domainstringResolved website domain. Often null for brand-new entities at time of filing.
company.linkedin_urlstringCompany LinkedIn URL (format: linkedin.com/company/{slug})
company.industriesarray[string]Industry classifications from firmographic enrichment
company.employee_count_lowintegerLower bound of employee count range
company.employee_count_highintegerUpper bound of employee count range
company.descriptionstringCompany description from firmographic enrichment

Data Object

FieldTypeRequiredDescription
data.summarystringHuman-readable one-line description of the filing event
data.entity_typestringNormalized legal structure: LLC, CORP, CORP_FOREIGN, LP, LLP, NONPROFIT
data.filing_numberstringState-assigned filing or entity number
data.filing_datestringDate the filing was recorded with the state SOS (ISO 8601 date)
data.state_sourcestringUS state where the entity was filed (2-letter abbreviation)
data.jurisdictionstringState/jurisdiction of formation (may differ from state_source for foreign entities)
data.citystringCity from principal address
data.zip_codestringZIP / postal code
data.addressstringPrincipal office street address
data.countystringCounty of formation (available in NY, PA)
data.statusstringEntity status: Good Standing, Active, etc.
data.registered_agentstringName of the registered agent on file
data.naics_codestringNAICS industry code (available in CT)
data.registered_emailstringEntity-level email from state registration (available in CT)
data.is_likely_shellbooleanRule-based prefilter flagging likely shell/holding entities
data.match_confidencestringDomain resolution confidence tier: very_high, high, medium, low, very_low

Officers Array

The data.officers array contains LinkedIn-verified officers/owners associated with the entity. May contain multiple entries (up to 3). Null when no officers could be resolved.

FieldTypeDescription
data.officers[].namestringOfficer/owner full name
data.officers[].titlestringTitle or role at the entity
data.officers[].linkedin_urlstringLinkedIn profile URL (format: linkedin.com/in/{slug})
data.officers[].match_confidencestringOfficer-to-entity match confidence: high or medium
data.officers[].emailstringBusiness email address (when available from contact database)

Enrichment Calls

The data.enrichment_calls object provides transparency into API calls made during enrichment. Useful for debugging and cost tracking.

FieldTypeDescription
data.enrichment_calls.serpintegerNumber of SERP API calls (domain + officer lookups)
data.enrichment_calls.daxaintegerNumber of Daxa profile verification calls
data.enrichment_calls.llmintegerNumber of LLM calls (domain extraction)

Delivery

  • Refresh: Daily
  • Format: JSONL + Parquet (Snappy compression)
  • Bucket structure: gs://autobound-new-business-formation/{YYYY-MM-DD-HH-MM-SS}/output.jsonl and output.parquet
  • Priority states: NY, CO, OR, PA, CT, IA (daily); remaining states available — contact [email protected]
  • Coverage: All 50 US states on roadmap

Fill Rates (Current Coverage)

FieldOverallNotes
company.domain~34%Higher for established entities filing in new states
company.linkedin_url~9%From enrichment + firmographic match
company.industries~6%Where firmographic data available
company.employee_count~8%Where firmographic data available
data.officers~7%LinkedIn-verified, up to 3 per entity
officers[].email~1.5%From contact database match on LinkedIn URL
data.registered_email~8%CT only (provided on filing)
data.naics_code~8%CT only (provided on filing)
data.county~48%NY + PA provide county
data.registered_agent~28%IA, OR, CO provide agent name