Data Engineering (2/1 Credits)
Learning Outcomes:
On successful completion of this course, student will be able to: LO1 – define the role of data engineering within the modern data ecosystem and its relationship to analytics and data science; LO2 – describe the fundamental concepts of data architecture, data pipelines, and the data lifecycle from ingestion to consumption; LO3 – differentiate between various data integration and transformation methods, including ETL and ELT approaches; LO4 – apply conceptual models to design simple data pipeline architectures that ensure data quality and reliability; LO5 – evaluate the suitability of different data tools and frameworks for specific business or analytical use cases.
Topics:
- Introduction to Data Engineering;
- Overview of Data Warehousing, Dimensional Modeling, and Pentaho Data Integration (Extract);
- Pentaho Data Integration (Load and Transform);
- The Data Engineering Lifecycle;
- Designing Good Data Architecture;
- Deployment: Data Visualization using Power BI ;
- Assignment ;
- Choosing Technologies Across the Data Lifecycle + Pentaho Intro ;
- Data Generation in Source Systems;
- Apache Airflow: Job Orchestration ;
- Review Material;
- Data Storage Concepts ;
- Data Ingestion;
- Queries, Modeling, and Transformation;
- Serving Data for Analytics, ML, and Reverse ETL;
- Security and Privacy in Data Engineering;
- The Future of Data Engineering;
- Mini Project Workshop;
- Integration.
Published at :
SOCIAL MEDIA
Let’s relentlessly connected and get caught up each other.
Looking for tweets ...