Python, Bash and SQL Essentials for Data Engineering (Duke University Coursera Specialization)

2026-04-17

Four courses covering the three languages every data engineer writes daily — Python and pandas for data manipulation, Linux and bash for the command line, Python+SQL for scripting real pipelines, and the web apps and CLI tools that wrap them. Duke University's foundations for a data engineering career.

Enroll on Coursera →

What You Will Build

Pandas-driven ETL jobs, bash automation for Linux data workflows, Python+SQL scripts for real warehouses, and the web apps and CLIs your team will actually use.

Courses in This Specialization

  1. Python and Pandas for Data Engineering — The pandas API, vectorization, and idiomatic data-engineering Python.
  2. Linux and Bash for Data Engineering — The command line and shell scripting that underlies every pipeline.
  3. Scripting with Python and SQL for Data Engineering — Bridging Python and SQL for production ETL.
  4. Web Applications and Command-Line Tools for Data Engineering — Wrapping pipelines in FastAPI, Click, and deployable tools.

Who This Is For