Schemas are sub areas of a data warehouse used for organizing data tables according to their use. You don’t strictly have to place tables in the right schema, but we highly recommend it, as it keeps data well organized.
STAGE
This is where raw data is saved.
Data that has been ingested to Peak using Dock > Data Sources will land in this schema by default.
TRANSFORM
This is where you save aggregated data that is ready for modeling.
These tables have been ‘transformed’.
PUBLISH
This is where you store data that has been processed and cleaned and is ready to show to business users.
Your dashboards and web apps will use data from this schema.
SANDPIT
This is where you store data that is not ready for modeling or use in apps.
It’s called Sandpit because you can play around with data in here!