Delivery

How to access the Autobound Signal Database via Google Cloud Storage.

We provision read access to Google Cloud Storage (GCS) buckets. You authenticate with a service account we provide and pull data on your own schedule.

Available Buckets

Each signal type has its own dedicated bucket:

BucketSignal Type
gs://autobound-job-changesJob changes
gs://autobound-hiring-trendsHiring velocity & trends
gs://autobound-glassdoorGlassdoor reviews
gs://autobound-linkedin-postsLinkedIn activity
gs://autobound-twitter-postsTwitter/X activity
gs://autobound-web-signalsWebsite intelligence
gs://autobound-seo-analysisSEO & traffic signals
gs://autobound-redditReddit mentions
gs://autobound-earningsEarnings transcripts
gs://autobound-sec-filingsSEC filings (10-K, 10-Q)
gs://autobound-patent-filingsPatent filings
gs://autobound-githubGitHub activity

Bucket Structure

Each bucket contains timestamped folders. Each folder has two files: output.jsonl and output.parquet.

gs://autobound-hiring-trends/
├── 2026-01-05T12-00-00Z/
│   ├── output.jsonl
│   └── output.parquet
├── 2026-01-04T12-00-00Z/
│   ├── output.jsonl
│   └── output.parquet
└── ...

Pull from the most recent folder to get the latest data.

Authentication

  1. We provide you with a GCP service account JSON key file
  2. Set the environment variable: export GOOGLE_APPLICATION_CREDENTIALS="/path/to/key.json"
  3. Use gsutil to access the buckets
Example: Listing and downloading files
# List available buckets
gsutil ls gs://autobound-*

# List folders in a bucket
gsutil ls gs://autobound-hiring-trends/

# Download JSONL
gsutil cp gs://autobound-hiring-trends/2026-01-05T12-00-00Z/output.jsonl ./

# Download Parquet
gsutil cp gs://autobound-hiring-trends/2026-01-05T12-00-00Z/output.parquet ./

File Formats

Both formats contain the same data. Choose based on your pipeline.

FormatFileBest For
JSONLoutput.jsonlStreaming ingestion, debugging, simple parsing
Parquetoutput.parquetData warehouses, analytics, large-scale processing
JSONL example

One signal per line:

{"signal_id":"7dfdb4b4-c0b4-4620-aca6-e7263123028e","signal_type":"job-change","signal_subtype":"jobChange","detected_at":"2025-11-14T00:00:00Z","association":"contact","company":{"name":"Harlem Stage","domain":"harlemstage.org"},"contact":{"full_name":"Lashawna Bean","email":"[email protected]","job_title":"Director Of People & Public Affairs"},"data":{"summary":"Joined Harlem Stage as Director of People & Public Affairs 2 months ago.","job_change_occurred":true,"months_since_job_change":2}}
Parquet schema
signal_id: STRING
signal_type: STRING
signal_subtype: STRING
detected_at: TIMESTAMP
association: STRING
company: STRUCT<name, domain, linkedin_url, industries, employee_count_low, employee_count_high, description>
contact: STRUCT<first_name, last_name, full_name, email, job_title, linkedin_url, city, state, country>
data: STRING (JSON-encoded)

The data field is JSON-encoded to support varying fields across signal types.


Refresh Cadence

CategoryFrequency
Job ChangesMonthly
Hiring TrendsMonthly
LinkedIn ActivityMonthly
Twitter/X ActivityMonthly
Glassdoor ReviewsMonthly
Reddit MentionsMonthly
Website IntelligenceMonthly
SEO & TrafficMonthly
GitHub ActivityMonthly
News & EventsMonthly
Tech StackMonthly
Earnings TranscriptsQuarterly
SEC FilingsQuarterly
Patent FilingsMonthly

Getting Started

  1. Contact [email protected] to get your service account credentials
  2. Choose which signal types you need
  3. Set up authentication and start pulling data