Delivery URIs

How to access the Autobound Signal Database via Google Cloud Storage.

We provision read access to Google Cloud Storage (GCS) buckets. You authenticate with a service account we provide and pull data on your own schedule.

Delivery Model

Each delivery is a delta. A timestamped folder contains only the signals detected since the previous delivery — not a cumulative snapshot. To build historical coverage, union folders backwards.

Deliveries are immutable. Once a folder lands in a bucket, it's never modified. If data needs to be re-delivered for a past period, it arrives as a new timestamped folder.

Bucket versions (v1, v2, v3) are schema evolutions. The underlying signal data is the same — field names, types, or structure changed between versions. Looking far enough back may require reading from an older bucket version with a different schema. We're standardizing schemas across all signal types in 2026.

Backfill folders are historical deltas. Delivered at onboarding to provide coverage for time periods before your contract started.

Available Buckets

Each signal type has its own dedicated bucket. Access each bucket directly by its URI.

SEC Filings

BucketDescription
gs://autobound-10k-v1/SEC 10-K annual filings
gs://autobound-10q-v1/SEC 10-Q quarterly filings
gs://autobound-20f-v2/SEC 20-F foreign company filings
gs://autobound-6k-v2/SEC 6-K foreign company reports
gs://autobound-8k/SEC 8-K current reports
gs://autobound-earnings-transcripts-v2/Earnings call transcripts

Social & Web Signals

BucketDescription
gs://autobound-linkedin-post-company-v2/LinkedIn posts (company-level)
gs://autobound-linkedin-post-contact-v3/LinkedIn posts (contact-level)
gs://autobound-linkedin-comments-contact-v2/LinkedIn comments (contact-level)
gs://autobound-glassdoor-company-v2/Glassdoor company reviews
gs://autobound-reddit-company-v2/Reddit mentions (company-level)
gs://autobound-twitter-company-posts-v2/Twitter/X posts (company-level)
gs://autobound-twitter-contact-posts-v2/Twitter/X posts (contact-level)
gs://autobound-youtube-company-v2/YouTube activity (company-level)
gs://autobound-youtube-contact-v2/YouTube activity (contact-level)

New Signals (Q1 2026)

BucketDescription
gs://autobound-sec-form-d-funding/SEC Form D funding filings
gs://autobound-new-business-formation/New business formations
gs://autobound-federal-contract-award/Federal contract awards
gs://autobound-conference/Conference speakers
gs://autobound-hackernews/HackerNews signals
gs://autobound-producthunt/ProductHunt launches
gs://autobound-podcast-company/Podcast appearances (company-level)
gs://autobound-podcast-contact/Podcast appearances (contact-level)

Company Intelligence

BucketDescription
gs://autobound-news-v4/News signals
gs://autobound-hiring-trends/Hiring trends
gs://autobound-hiring-velocity-v1/Hiring velocity
gs://autobound-employee-growth-v2/Employee growth signals
gs://autobound-github-v1/GitHub activity
gs://autobound-product-reviews-v2/Product reviews (G2)
gs://autobound-patents-v2/Patent filings (v2)
gs://autobound-seo-traffic-v2/SEO & traffic signals
gs://autobound-website-intelligence-v1/Website intelligence
gs://autobound-work-milestones-v2/Work milestones
gs://autobound-financials/Financial data
gs://autobound-tech-used/Technology stack
gs://autobound-intent/Intent signals

Reference Data

BucketDescription
gs://autobound-company-database/Company database
gs://autobound-contact-database/Contact database
gs://autobound-manifests/Data manifests

Bucket Structure

Each bucket contains timestamped folders. Each folder has two files: output.jsonl and output.parquet.

gs://autobound-news-v4/
├── 2026-01-31-17-30-00/
│   ├── output.jsonl
│   └── output.parquet
├── 2025-12-31-17-30-00/
│   ├── output.jsonl
│   └── output.parquet
└── ...

Pull from the most recent folder to get the latest data.

Authentication

  1. We provide you with a GCP service account JSON key file
  2. Set the environment variable: export GOOGLE_APPLICATION_CREDENTIALS="/path/to/key.json"
  3. Use gsutil to access the buckets
Example: Listing and downloading files
# List folders in a bucket (access each bucket directly by URI)
gsutil ls gs://autobound-news-v4/

# Download JSONL
gsutil cp gs://autobound-news-v4/2026-01-31-17-30-00/output.jsonl ./

# Download Parquet
gsutil cp gs://autobound-news-v4/2026-01-31-17-30-00/output.parquet ./

Tip: You can discover all available signal buckets by running:

gsutil ls -p autobound-signal-delivery

This returns the names of all signal buckets in the project - including buckets you are not licensed for. You can browse bucket names and folder timestamps freely, but accessing the data inside a bucket (reading files) requires objectViewer on that specific bucket, which is granted during onboarding. Unlicensed buckets will return AccessDeniedException: 403 when you attempt to read their contents.


File Formats

Both formats contain the same data. Choose based on your pipeline.

FormatFileBest For
JSONLoutput.jsonlStreaming ingestion, debugging, simple parsing
Parquetoutput.parquetData warehouses, analytics, large-scale processing
JSONL example

One signal per line:

{"signal_id":"7dfdb4b4-c0b4-4620-aca6-e7263123028e","signal_type":"news","detected_at":"2026-01-15T10:30:00Z","association":"company","company":{"name":"Acme Corp","domain":"acme.com"},"data":{"summary":"Acme Corp announces expansion into European markets...","source_url":"https://example.com/news/acme-expansion"}}
Parquet schema
signal_id: STRING
signal_type: STRING
signal_subtype: STRING
detected_at: TIMESTAMP
association: STRING
company: STRUCT<name, domain, linkedin_url, industries, employee_count_low, employee_count_high, description>
contact: STRUCT<first_name, last_name, name, email, job_title, linkedin_url, city, state, country>
data: STRING (JSON-encoded)

The data field is JSON-encoded to support varying fields across signal types.


Refresh Cadence

📘

June 2026 Update: News signals transitioning from weekly to daily in early June 2026. During transition, weekly and daily deliveries run in parallel.

CategoryFrequencyLast DeliveredNext Delivery
SEC 10-K Annual FilingsWeeklyJun 23, 2026Jun 30, 2026
SEC 10-Q Quarterly FilingsWeeklyJun 23, 2026Jun 30, 2026
SEC 8-K Current ReportsWeeklyJun 23, 2026Jun 30, 2026
SEC 6-K Foreign ReportsWeeklyJun 23, 2026Jun 30, 2026
SEC 20-F Foreign FilingsWeeklyJun 2, 2026Jun 16, 2026
Earnings TranscriptsWeeklyJun 23, 2026Jun 30, 2026
Financial FundamentalsWeeklyJun 23, 2026Jun 30, 2026
NewsDailyJun 23, 2026Jun 24, 2026
Hiring TrendsWeeklyJun 23, 2026Jun 30, 2026
Hiring VelocityWeeklyJun 23, 2026Jun 30, 2026
Work MilestonesWeeklyJun 23, 2026Jun 30, 2026
LinkedIn Posts (Company)MonthlyJun 12, 2026Jul 12, 2026
LinkedIn Posts (Contact)Bi-weeklyJun 22, 2026Jul 6, 2026
LinkedIn Comments (Contact)MonthlyJun 12, 2026Jul 12, 2026
Glassdoor ReviewsMonthlyJun 12, 2026Jul 12, 2026
Reddit Mentions (Company)MonthlyMay 25, 2026Jun 24, 2026
Twitter/X Posts (Company)MonthlyJun 12, 2026Jul 12, 2026
Twitter/X Posts (Contact)MonthlyJun 12, 2026Jul 12, 2026
YouTube Activity (Company)MonthlyJun 12, 2026Jul 12, 2026
YouTube Activity (Contact)MonthlyJun 12, 2026Jul 12, 2026
GitHub ActivityMonthlyJun 12, 2026Jul 12, 2026
Product Reviews (G2)MonthlyJun 12, 2026Jul 12, 2026
SEO & TrafficMonthlyJun 23, 2026Jul 23, 2026
Patent FilingsMonthlyJun 2, 2026Jul 1, 2026
Website IntelligenceMonthlyJun 2, 2026Jul 1, 2026
Employee GrowthQuarterlyApr 15, 2026Jul 15, 2026
Conference SpeakersMonthlyJun 30, 2026Jul 30, 2026
Federal Contract AwardsDailyJun 23, 2026Jun 24, 2026
HackerNews MentionsDailyJun 23, 2026Jun 24, 2026
ProductHunt LaunchesDailyJun 23, 2026Jun 24, 2026
Podcast CompanyWeeklyJun 8, 2026Jun 22, 2026
Podcast ContactWeeklyJun 8, 2026Jun 22, 2026
SEC Form D Funding2× DailyJun 23, 2026Jun 24, 2026
New Business FormationsMonthlyMay 31, 2026Jun 30, 2026
Company DatabaseMonthlyMay 10, 2026Jul 10, 2026
Contact DatabaseMonthlyMay 10, 2026Jul 10, 2026
Data ManifestsPer delivery

Getting Started

  1. Contact [email protected] to get your service account credentials
  2. Choose which signal types you need
  3. Set up authentication and start pulling data