Onboarding Guide

Get up and running with Autobound Signal Data in under 30 minutes.

Welcome to Autobound Signal Data! This guide covers technical setup and working with our team.


Onboarding Timeline

WeekMilestone
Week 1Kickoff call, credentials delivered, Slack Connect setup
Week 2First data pull, schema review, pipeline built
Week 3-4Signals matched to your records, first users enabled
OngoingRegular check-in calls as needed

Onboarding Checklist

Team Setup

Task
☐ Join Slack Connect channel with Autobound
☐ Schedule kickoff call (30 min)
☐ Identify project team (see roles)

Technical Setup

TaskResources
☐ Receive and activate GCP credentialsSetup instructions
☐ Verify bucket accessDelivery docs
☐ Download first data fileBucket structure
☐ Parse and validate schemaSchema docs
☐ Match signals to your recordsEntity resolution

Go-Live

TaskResources
☐ Set up automated refresh pipelineDelivery docs
☐ Surface signals to end users
☐ Schedule 30-day review call

Who Should Be Involved

RoleResponsibility
Data EngineerBuild ETL pipeline, manage GCS access
Product ManagerDefine signal use cases, prioritize integration
Sales/Marketing LeadershipValidate data quality and fit for target use cases

Kickoff attendees: Data Engineer + Product Manager recommended.


Working with Autobound

Slack Connect

Your shared channel is for:

  • Quick questions (response within 4 hours)
  • Data quality issues
  • Schema clarifications

Success Sessions

30-minute check-ins, 2-3x per month for the first 1-2 months:

  • Review pipeline progress
  • Troubleshoot integration issues
  • Optimize match rates
  • Share best practices

After go-live, we move to monthly or as-needed.


Core Milestones

Week 1: Access & Kickoff

  • Kickoff call completed
  • Credentials received and tested
  • Slack Connect active

Week 2: First Data Pull

  • Downloaded data from at least one bucket
  • Validated schema fields
  • Initial pipeline created

Week 4: Review & Optimize

  • Match rate calculated
  • Signal taxonomy defined for your use case
  • 30-day review scheduled

Technical Setup Details

Authentication

# Activate service account
gcloud auth activate-service-account --key-file=~/credentials/autobound-key.json

# Verify access by downloading from one of your licensed buckets
gcloud storage cp gs://autobound-10k/2026-01-15T00-00-00Z/output.parquet ./test-download.parquet
💡

Note: Bucket listing is not supported. Use the direct bucket URLs from the Available Buckets section to verify access.

Bucket Structure

Each signal type has its own dedicated bucket with timestamped folders:

gs://autobound-{signal-type}/
├── 2026-01-05T12-00-00Z/    ← Most recent
│   ├── output.jsonl
│   └── output.parquet

Always pull from the most recent folder. See Delivery docs for details.

Available Buckets

⚠️

Important: Bucket listing is not supported. Use the direct bucket URLs below to access your licensed signal types.

Company Signals:

Signal TypeBucket URL
SEC 10-K Filingsgs://autobound-10k/
SEC 10-Q Filingsgs://autobound-10q/
SEC 8-K Filingsgs://autobound-8k/
SEC 20-F Filingsgs://autobound-20f/
SEC 6-K Filingsgs://autobound-6k/
Reddit (Company)gs://autobound-reddit-company/
Glassdoor Reviewsgs://autobound-glassdoor-company/
LinkedIn Posts (Company)gs://autobound-linkedin-post-company/
YouTube (Company)gs://autobound-youtube-company/
X/Twitter (Company)gs://autobound-x-company/
GitHub Activitygs://autobound-github/
SEO Trafficgs://autobound-seo-traffic/
News Eventsgs://autobound-news/
Hiring Velocitygs://autobound-hiring-velocity/
Hiring Trendsgs://autobound-hiring-trends/
Earnings Transcriptsgs://autobound-earnings-transcripts/
Company Financialsgs://autobound-financials/
Product Reviewsgs://autobound-product-reviews/
Website Intelligencegs://autobound-website-intelligence/
Employee Growthgs://autobound-employee-growth/
Patentsgs://autobound-patents/
Tech Usedgs://autobound-tech-used/
Company Databasegs://autobound-company-database/

Contact Signals:

Signal TypeBucket URL
LinkedIn Posts (Contact)gs://autobound-linkedin-post-contact/
LinkedIn Comments (Contact)gs://autobound-linkedin-comments-contact/
X/Twitter (Contact)gs://autobound-x-contact/
YouTube (Contact)gs://autobound-youtube-contact/
Work Milestonesgs://autobound-work-milestones/
Contact Databasegs://autobound-contact-database/

Download Data

# Download from a specific bucket (replace with your licensed bucket and timestamp)
gcloud storage cp gs://autobound-10k/2026-01-05T12-00-00Z/output.parquet ./
💡

Tip: Your account manager will provide the specific bucket URLs and timestamps for your licensed data.

FormatBest For
JSONLStreaming, debugging
ParquetData warehouses, analytics

Schema Overview

{
  "signal_id": "uuid",
  "signal_type": "10k",
  "signal_subtype": "aiInvestment",
  "detected_at": "2026-01-05T12:00:00Z",
  "company": {
    "name": "Acme Corp",
    "domain": "acme.com"
  },
  "data": {
    "summary": "...",
    "relevance": 0.85
  }
}
Schema Overview

High-level schema structure

💡

Full field reference: For complete field definitions, see the individual signal pages under Company Insights or Contact Insights.


Matching Signals

PriorityJoin Key
1stcompany.domain
2ndcompany.linkedin_url
3rdcompany.name (fuzzy)
Entity Resolution

Best practices for matching


Key Documentation

ResourceLink
Schema/docs/schema
Entity Resolution/docs/resolution
Delivery & Buckets/docs/delivery
Signal Catalog/docs/signal-catalog
Sample Files/docs/files
Status & Known Issues/docs/status
Roadmap/docs/roadmap
ChangelogChangelog

What’s Next

Learn more about our signal schema and resolution methods.