- AWS S3 account
- AWS access key and secret access key
- PostHog account
Setting up an S3 bucket for Posthog:
1
In the AWS console, search for S3.

2
Click 
Create bucket
.
3
Enter a bucket name, scroll to the bottom of the page, and click 
Create bucket
.
4
Copy the bucket name for later use.

Getting AWS access key and secret access key:
1
Go to the AWS console and click on the name of your account. Then, in the expanded drop-down list, select 
Security credentials
.
2
Under Access keys, click 
Create access key
.
3
After you confirm your creation, you should see this page where you can show and copy the Access key and Secret access key values.

Sending Posthog data to S3
1
In PostHog, go to Data pipelines > New destination.

2
Search for S3, then click 
Create
.
3
Fill out these fields:
You can use the Backfill batch export function located in the Runs tab to load historical data.

Name
: assign name to your desinationModel
: choose either events or personsBatch interval
: choose either hourly or dailyBucket
: name of the bucket you created on AWSRegion
: name of the region where the bucket was createdKey prefix
: assign a key-prefix for this integrationFormat
: selectJSON lines
AWS Access Key ID and Secret Access Key
: paste values from the previous section
Paused
button next to S3 to automatically start syncing your data and then click Create
.


Definite Instructions
1
Go to this link to add an integration. Search for the S3 extractor and click it.
2
Fill out these fields:
Name
: Give your integration an easily identifiable nameAccess Key Secret
: paste in the AWS Access key from the previous sectionAccess Key Secret
: paste in the Secret access key from the previous sectionRegion
: region where the bucket was created written in xx-xxxxx-xx format (e.g. us-east-1 for US East (N. Virginia))Bucket
: name of bucket you created on AWSDataset
: assign name to your datasetPrimary Key
: optionalPath Pattern
: optionalFormat
: typejasonl
- Check the
Is Posthog
checkbox Schedule Cron
: Schedule for how frequently your data syncs. You can use the default of once a day at 5AM UTC (`0 5 * * *“) or you can set your own schedule. This tool is helpful in converting your desired timing to cron syntax.
Save
. You should see a confirmation that your integration was successfully created.