Skip to content

caogiathinh/modern-data-warehouse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Modern Data Warehouse

This project demonstrates how to build a modern data warehouse using SQL Server, including ETL processes, data modeling, and analytics. It provides sample datasets, SQL scripts, and documentation to help you set up and explore a scalable data warehouse architecture.

Features

  • End-to-end data warehouse setup
  • Sample CRM and ERP datasets
  • ETL and data loading SQL scripts
  • Modular directory structure
  • Documentation and test placeholders

Directory Structure

datasets/         # Sample source data (CRM, ERP)
docs/             # Project documentation
scripts/          # SQL scripts for database setup and ETL
tests/            # Placeholder for test scripts
LICENSE           # Project license
README.md         # Project overview and instructions

Getting Started

  1. Clone this repository.
  2. Review the sample datasets in datasets/source_crm and datasets/source_erp.
  3. Use the SQL scripts in scripts/ to initialize and load your database.
  4. Refer to the documentation in docs/ for further guidance.

Datasets

  • CRM Source (datasets/source_crm/):
    • cust_info.csv, prd_info.csv, sales_details.csv
  • ERP Source (datasets/source_erp/):
    • CUST_AZ12.csv, LOC_A101.csv, PX_CAT_G1V2.csv

Scripts

  • Database Initialization: scripts/initdatabase.sql
  • Bronze Layer: scripts/bronze/dll_bronze.sql, scripts/bronze/proc_load_bronze.sql

License

This project is licensed under the terms of the LICENSE file in this repository.

About

Building a modern data warehouse with SQL Server, including ETL processes, data modeling and analytics

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages