Delivery URIs

How to access the Autobound Signal Database via Google Cloud Storage.

We provision read access to Google Cloud Storage (GCS) buckets. You authenticate with a service account we provide and pull data on your own schedule.

Delivery Model

Each delivery is a delta. A timestamped folder contains only the signals detected since the previous delivery — not a cumulative snapshot. To build historical coverage, union folders backwards.

Deliveries are immutable. Once a folder lands in a bucket, it's never modified. If data needs to be re-delivered for a past period, it arrives as a new timestamped folder.

Bucket versions (v1, v2, v3) are schema evolutions. The underlying signal data is the same — field names, types, or structure changed between versions. Looking far enough back may require reading from an older bucket version with a different schema. We're standardizing schemas across all signal types in 2026.

Backfill folders are historical deltas. Delivered at onboarding to provide coverage for time periods before your contract started.

Available Buckets

Each signal type has its own dedicated bucket. Access each bucket directly by its URI.

SEC Filings

BucketDescription
gs://autobound-10k-v1/SEC 10-K annual filings
gs://autobound-10q-v1/SEC 10-Q quarterly filings
gs://autobound-20f-v2/SEC 20-F foreign company filings
gs://autobound-6k-v2/SEC 6-K foreign company reports
gs://autobound-8k/SEC 8-K current reports
gs://autobound-earnings-transcripts-v2/Earnings call transcripts

Social & Web Signals

BucketDescription
gs://autobound-linkedin-post-company-v2/LinkedIn posts (company-level)
gs://autobound-linkedin-post-contact-v3/LinkedIn posts (contact-level)
gs://autobound-linkedin-comments-contact-v2/LinkedIn comments (contact-level)
gs://autobound-glassdoor-company-v2/Glassdoor company reviews
gs://autobound-reddit-company-v2/Reddit mentions (company-level)
gs://autobound-twitter-company-posts-v2/Twitter/X posts (company-level)
gs://autobound-twitter-contact-posts-v2/Twitter/X posts (contact-level)
gs://autobound-youtube-company-v2/YouTube activity (company-level)
gs://autobound-youtube-contact-v2/YouTube activity (contact-level)

New Signals (Q1 2026)

BucketDescription
gs://autobound-sec-form-d-funding/SEC Form D funding filings
gs://autobound-new-business-formation/New business formations
gs://autobound-federal-contract-award/Federal contract awards
gs://autobound-conference/Conference speakers
gs://autobound-hackernews/HackerNews signals
gs://autobound-producthunt/ProductHunt launches
gs://autobound-podcast-appearance/Podcast appearances

Company Intelligence

BucketDescription
gs://autobound-news-v4/News signals
gs://autobound-hiring-trends/Hiring trends
gs://autobound-hiring-velocity-v1/Hiring velocity
gs://autobound-employee-growth-v2/Employee growth signals
gs://autobound-github-v1/GitHub activity
gs://autobound-product-reviews-v2/Product reviews (G2)
gs://autobound-patents-v2/Patent filings (v2)
gs://autobound-seo-traffic-v2/SEO & traffic signals
gs://autobound-website-intelligence-v1/Website intelligence
gs://autobound-work-milestones-v2/Work milestones
gs://autobound-financials/Financial data
gs://autobound-tech-used/Technology stack
gs://autobound-intent/Intent signals

Reference Data

BucketDescription
gs://autobound-company-database/Company database
gs://autobound-contact-database/Contact database
gs://autobound-manifests/Data manifests

Bucket Structure

Each bucket contains timestamped folders. Each folder has two files: output.jsonl and output.parquet.

gs://autobound-news-v4/
├── 2026-01-31-17-30-00/
│   ├── output.jsonl
│   └── output.parquet
├── 2025-12-31-17-30-00/
│   ├── output.jsonl
│   └── output.parquet
└── ...

Pull from the most recent folder to get the latest data.

Authentication

  1. We provide you with a GCP service account JSON key file
  2. Set the environment variable: export GOOGLE_APPLICATION_CREDENTIALS="/path/to/key.json"
  3. Use gsutil to access the buckets
Example: Listing and downloading files
# List folders in a bucket (access each bucket directly by URI)
gsutil ls gs://autobound-news-v4/

# Download JSONL
gsutil cp gs://autobound-news-v4/2026-01-31-17-30-00/output.jsonl ./

# Download Parquet
gsutil cp gs://autobound-news-v4/2026-01-31-17-30-00/output.parquet ./

Tip: You can discover all available signal buckets by running:

gsutil ls -p autobound-signal-delivery

This returns the names of all signal buckets in the project - including buckets you are not licensed for. You can browse bucket names and folder timestamps freely, but accessing the data inside a bucket (reading files) requires objectViewer on that specific bucket, which is granted during onboarding. Unlicensed buckets will return AccessDeniedException: 403 when you attempt to read their contents.


File Formats

Both formats contain the same data. Choose based on your pipeline.

FormatFileBest For
JSONLoutput.jsonlStreaming ingestion, debugging, simple parsing
Parquetoutput.parquetData warehouses, analytics, large-scale processing
JSONL example

One signal per line:

{"signal_id":"7dfdb4b4-c0b4-4620-aca6-e7263123028e","signal_type":"news","detected_at":"2026-01-15T10:30:00Z","association":"company","company":{"name":"Acme Corp","domain":"acme.com"},"data":{"summary":"Acme Corp announces expansion into European markets...","source_url":"https://example.com/news/acme-expansion"}}
Parquet schema
signal_id: STRING
signal_type: STRING
signal_subtype: STRING
detected_at: TIMESTAMP
association: STRING
company: STRUCT<name, domain, linkedin_url, industries, employee_count_low, employee_count_high, description>
contact: STRUCT<first_name, last_name, name, email, job_title, linkedin_url, city, state, country>
data: STRING (JSON-encoded)

The data field is JSON-encoded to support varying fields across signal types.


Refresh Cadence

📘

June 2026 Update: News signals transitioning from weekly to daily in early June 2026. During transition, weekly and daily deliveries run in parallel.

CategoryFrequencyLast DeliveredNext Delivery
SEC 10-K Annual FilingsWeeklyJun 2, 2026Jun 9, 2026
SEC 10-Q Quarterly FilingsWeeklyJun 2, 2026Jun 9, 2026
SEC 8-K Current ReportsWeeklyJun 2, 2026Jun 9, 2026
SEC 6-K Foreign ReportsWeeklyJun 2, 2026Jun 9, 2026
SEC 20-F Foreign FilingsWeeklyJun 2, 2026Jun 9, 2026
Earnings TranscriptsWeeklyJun 2, 2026Jun 9, 2026
Financial FundamentalsWeeklyJun 2, 2026Jun 9, 2026
NewsWeeklyJun 2, 2026Jun 9, 2026
Hiring TrendsWeeklyJun 2, 2026Jun 9, 2026
Hiring VelocityWeeklyJun 2, 2026Jun 9, 2026
Work MilestonesWeeklyJun 2, 2026Jun 9, 2026
LinkedIn Posts (Company)MonthlyMay 12, 2026Jun 12, 2026
LinkedIn Posts (Contact)Bi-weeklyMay 25, 2026Jun 8, 2026
LinkedIn Comments (Contact)MonthlyMay 12, 2026Jun 12, 2026
Glassdoor ReviewsMonthlyMay 12, 2026Jun 12, 2026
Reddit Mentions (Company)MonthlyMay 25, 2026Jun 24, 2026
Twitter/X Posts (Company)MonthlyMay 12, 2026Jun 12, 2026
Twitter/X Posts (Contact)MonthlyMay 12, 2026Jun 12, 2026
YouTube Activity (Company)MonthlyMay 12, 2026Jun 12, 2026
YouTube Activity (Contact)MonthlyMay 12, 2026Jun 12, 2026
GitHub ActivityMonthlyMay 12, 2026Jun 12, 2026
Product Reviews (G2)MonthlyMay 12, 2026Jun 12, 2026
SEO & TrafficMonthlyMay 25, 2026Jun 24, 2026
Patent FilingsMonthlyJun 2, 2026Jul 1, 2026
Website IntelligenceMonthlyJun 2, 2026Jul 1, 2026
Employee GrowthQuarterlyApr 15, 2026Jul 15, 2026
Conference SpeakersMonthlyMay 10, 2026Jun 10, 2026
Federal Contract AwardsDailyJun 2, 2026Jun 3, 2026
HackerNews MentionsDailyJun 2, 2026Jun 3, 2026
ProductHunt LaunchesDailyJun 2, 2026Jun 3, 2026
Podcast AppearancesMonthlyApr 24, 2026Jun 3, 2026
SEC Form D Funding2× DailyJun 2, 2026Jun 3, 2026
New Business FormationsMonthlyApr 1, 2026Jul 1, 2026
Company DatabaseMonthlyMay 10, 2026Jun 10, 2026
Contact DatabaseMonthlyMay 10, 2026Jun 10, 2026
Data ManifestsPer delivery

Getting Started

  1. Contact [email protected] to get your service account credentials
  2. Choose which signal types you need
  3. Set up authentication and start pulling data