Skip to content

YigitBCesur/Duke-DataEngineering

Repository files navigation

\Duke-DataEngineering# Python, Bash, and SQL for Data Engineering

Duke University | Specialization

This repository contains my projects, lab work, and technical notes from the Duke University Specialization. I use it to track my learning and show my data engineering skills.


🏗 Curriculum and Study Roadmap

I organized this repository based on the official four-course curriculum:

📂 [01] Python and Pandas for Data Engineering

  • Course Topics: Setting up Python environments, using VS Code and Vim, and data manipulation with Pandas (read/write data structures and files).
  • Skills I Gained: Python Programming, Pandas, Git, Data Manipulation, Software Development Tools, Data Structures, Development Environment, NumPy, Virtual Environment, Version Control, Data Analysis Software.
  • Folder: /01-Python-Pandas/

📂 [02] Linux and Bash for Data Engineering

  • Course Topics: Using Linux tools to build solutions, and developing Bash syntax to control Linux.
  • Skills I Gained: Shell Scripting, Bash (Scripting Language), Linux Commands, File Management, Command-Line Interface, Scripting, Data Manipulation, Development Environment, Remote Access Systems, Unix, Data Processing, File Systems, Data Management, Unix Commands, Scripting Languages, Unix Shell, Linux Administration.
  • Folder: /02-Linux-Bash/

📂 [03] Scripting with Python and SQL for Data Engineering

  • Course Topics: Connecting to and querying SQL databases with Python, extracting data from different sources, and using web scraping techniques.
  • Skills I Gained: Data Structures, SQL, Data Import/Export, Python Programming, Web Scraping, Scripting, Data Persistence, JSON, MySQL, Data Manipulation, Database Management, Spatial Analysis, Databases, Hypertext Markup Language (HTML), Data Capture.
  • Folder: /03-Python-SQL/

📂 [04] Web Applications and Command-Line Tools for Data Engineering

  • Course Topics: Constructing Python microservices with FastAPI, building Command-Line Tools (CLI) with Click, and using Jupyter notebooks.
  • Skills I Gained: AWS SageMaker, Jupyter, Command-Line Interface, Microservices, Package and Software Management, Containerization, CI/CD, Test Automation, Data Pipelines, Algorithms, Python Programming, Cloud Engineering, Applied Machine Learning.
  • Folder: /04-Capstone-CLI-Web/

🛠 My Development Standards

I follow these technical rules to work like a professional:

  • Organized Environments: I use a new .venv (virtual environment) for every project. This keeps my tools and libraries organized.
  • Version Control (Git): I save all my progress using Git commands and share my code on GitHub.
  • Native Linux Experience: I work using WSL2 (Ubuntu) to learn in a real-world data engineering environment.# Duke-Data-Eng-Python

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages