Fixed

S3 Delivery: March Gap Backfill Complete + Full Mirror Architecture

Summary

Our S3 delivery pipeline experienced a 29-day sync outage from February 25 to March 26, 2026. During this period, signal data continued to be produced and delivered to GCS buckets normally, but was not propagated to S3 mirrors. This affected all S3 delivery customers.

All missing data has been fully backfilled as of April 16, 2026. We have also upgraded the sync architecture to prevent recurrence.


What happened

  • Feb 25: The GCS→S3 sync scheduler was inadvertently removed during infrastructure maintenance
  • Feb 25 – Mar 26: S3 mirrors stopped receiving updates. GCS (primary) was unaffected
  • Mar 26: Sync pipeline was restored. New data began flowing to S3 again
  • Apr 16: Full backfill of all 121 missing deliveries completed across 36 buckets (242 files)

Impacted buckets (36)

BucketMissing deliveries
autobound-10k-v14
autobound-10q-v14
autobound-20f-v11
autobound-20f-v21
autobound-6k-v13
autobound-6k-v21
autobound-8k4
autobound-conference-cfp9
autobound-earnings-transcripts3
autobound-earnings-transcripts-v21
autobound-federal-contract-award8
autobound-financials1
autobound-github-v11
autobound-glassdoor-company-v21
autobound-hackernews12
autobound-hiring-trends4
autobound-hiring-velocity-v14
autobound-linkedin-comments-contact-v11
autobound-linkedin-post-company-v21
autobound-linkedin-post-contact-v32
autobound-news-v23
autobound-news-v31
autobound-patents1
autobound-podcast-appearance17
autobound-product-reviews-v11
autobound-producthunt9
autobound-reddit-company-v21
autobound-sec-form-d-funding12
autobound-seo-traffic1
autobound-twitter-company-posts1
autobound-twitter-contact-posts1
autobound-website-intelligence-v11
autobound-work-milestones3
autobound-work-milestones-v21
autobound-youtube-company1
autobound-youtube-contact1

What we changed

  1. Full mirror architecture: S3 now mirrors every folder in every signal bucket, not just the current day. Any historical gap is automatically caught and filled on the next sync run.
  2. GCP-native scheduling: The sync trigger has been moved from an external scheduler to GCP Cloud Scheduler, eliminating the single point of failure that caused this outage.
  3. Increased capacity: Cloud Run job memory upgraded from 8GB to 16GB to handle large signal files (e.g., website-intelligence at 13.7GB).
  4. Backfill manifest: A backfill manifest (backfill-2026-04-16.json) has been uploaded to s3://autobound-s3-manifests/syncs/ documenting all recovered deliveries.

Action required

None. All missing data is now available in your S3 buckets in the same folder structure and file naming convention as regular deliveries. No changes to your ingestion pipeline are needed.

If you notice any remaining gaps, please reach out in your Slack Connect channel.