"This project is a data warehouse designed to integrate data from multiple sources (CRM, ERP) into a structured format for analysis. The architecture follows the Bronze-Silver-Gold layered approach, ensuring data quality and accessibility.
- CRM System (Customer-related data)
- ERP System (Enterprise resource data)
- Extraction: Data is ingested from multiple sources.
- Transformation: Data is cleaned, deduplicated, and enriched.
- Loading: Processed data is stored in analytical tables for reporting.
This project follows the Bronze-Silver-Gold architecture
| Layer | Purpose |
|---|---|
| Bronze | Raw data ingestion from CSV Files into SQL Server Database. |
| Silver | Cleaned, Normalized, structured data |
| Gold | Optimized fact & dimension tables for reporting |
-
Database: PostgreSQL
-
ETL Pipeline: SQL-based transformations and stored procedures
-
Tools Used: SQL, draw.io.
✅ Automated ETL pipeline for seamless data processing
✅ Data quality checks & validation
✅ Optimized for analytics & dashboard reporting
✅ Scalable architecture for handling large datasets
Clone the repository:
git clone https://github.com/SQL_Data_Warehouse_Project.git
cd SQL_Data_Warehouse_ProjectRun the SQL scripts in your database:
-- Example command to execute SQL script
psql -U your_user -d your_database -f etl_pipeline.sql- Improve query performance with partitioning & indexing
- Add real-time streaming data ingestion
- Enhance data governance with role-based access control (RBAC)
data-warehouse-project/
│
├── datasets/ # Raw datasets used for the project (ERP and CRM data)
│
├── docs/ # Project documentation and architecture details
│ ├── architecture.drawio # Draw.io file shows the project's architecture
│ ├── data_catalog.md # Catalog of datasets, including field descriptions and metadata
│ ├── data_flow.drawio # Draw.io file for the data flow diagram
│ ├── data_mart.drawio # Draw.io file for data models (star schema)
│ ├── naming-conventions.md # Consistent naming guidelines for tables, columns, and files
│
├── scripts/ # SQL scripts for ETL and transformations
│ ├── bronze/ # Scripts for extracting and loading raw data
│ ├── silver/ # Scripts for cleaning and transforming data
│ ├── gold/ # Scripts for creating analytical models
│
├── tests/ # Test scripts and quality files
│
├── README.md # Project overview and instructions
├── .gitignore # Files and directories to be ignored by Git
└── requirements.txt # Dependencies and requirements for the project
The Analtyics Repository serves as a continuation of this project.

