To connect Definite to Posthog using S3, you need the following:

  1. AWS S3 account
  2. AWS access key and secret access key
  3. PostHog account

Setting up an S3 bucket for Posthog:

1

In the AWS console, search for S3.

2

Click Create bucket.

3

Enter a bucket name, scroll to the bottom of the page, and click Create bucket.

4

Copy the bucket name for later use.

Getting AWS access key and secret access key:

1

Go to the AWS console and click on the name of your account. Then, in the expanded drop-down list, select Security credentials.

2

Under Access keys, click Create access key.

3

After you confirm your creation, you should see this page where you can show and copy the Access key and Secret access key values.

Sending Posthog data to S3

1

In PostHog, go to Data pipelines > New destination.

2

Search for S3, then click Create.

3

Fill out these fields:

  • Name : assign name to your desination
  • Model : choose either events or persons
  • Batch interval : choose either hourly or daily
  • Bucket : name of the bucket you created on AWS
  • Region : name of the region where the bucket was created
  • Key prefix : assign a key-prefix for this integration
  • Format : select JSON lines
  • AWS Access Key ID and Secret Access Key : paste values from the previous section

Untick the Paused button next to S3 to automatically start syncing your data and then click Create.

You can use the Backfill batch export function located in the Runs tab to load historical data.


Please refer to this guide for more information about PostHog S3 destination for batch exports.

Definite Instructions

1

Go to this link to add an integration. Search for the S3 extractor and click it.

2

Fill out these fields:

  • Name : Give your integration an easily identifiable name
  • Access Key Secret : paste in the AWS Access key from the previous section
  • Access Key Secret : paste in the Secret access key from the previous section
  • Region : region where the bucket was created written in xx-xxxxx-xx format (e.g. us-east-1 for US East (N. Virginia))
  • Bucket : name of bucket you created on AWS
  • Dataset : assign name to your dataset
  • Primary Key : optional
  • Path Pattern : optional
  • Format : type jasonl
  • Check the Is Posthog checkbox
  • Schedule Cron : Schedule for how frequently your data syncs. You can use the default of once a day at 5AM UTC (`0 5 * * *“) or you can set your own schedule. This tool is helpful in converting your desired timing to cron syntax.

Click Save. You should see a confirmation that your integration was successfully created.