etl

Here are 63 public repositories matching this topic...

jupyter-naas / naas

Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)

open-source data-science data binder ai integration jupyter pipeline etl engine data-transformation jupyterlab notebooks

Updated Feb 14, 2025
Python

markwsutton / ETL-using-Python-SQL

Star

ETL using Python in Jupyter Notebook, loading CSV, cleaning data, and saving to SQL Database.

python sql database etl csv-files

Updated Nov 17, 2020
Jupyter Notebook

EimisPacheco / Several-Jupyter-Notebooks

Star

Jupyter Notebooks with different purposes: Social Network WebScrapping, ETL, Selenium WebDriver for Web Testing, Automation using Python, Data Wrangling, Data Transformation, Data Cleaning, Stock Market Analysis, APIs, Machine learning Algorithms, etc...

etl machine-learning-algorithms data-transformation data-wrangling data-cleaning stock-market-analysis selenium-python social-network-webscrapping

Updated Aug 9, 2020
Jupyter Notebook

cvilla87 / PySpark-ETL-Telecom

Star

Jupyter Notebook showing how to process Telecom datasets using PySpark (SparkSQL and DataFrames) and plotting the results using Matplotlib.

python unix json csv spark hadoop etl jupyter-notebook pyspark hdfs sparksql matplotlib dataframe

Updated Dec 3, 2018
Jupyter Notebook

elasticlabs / airflow-jupyter-docker-compose

Star

Orchestration of data science and earth observation models in Apache Airflow, scale-up with Celery Executor, experiment with jupyter notebook using a docker containers composition

data-science airflow etl jupyter-notebook apache-airflow airflow-dags

Updated Aug 23, 2022
Python

souvik-databricks / dlt-with-debug

Star

A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT run and Non-DLT interactive notebook run.

big-data spark etl python3 databricks dlt etl-pipeline big-data-processing delta-live-tables

Updated Dec 7, 2022
Python

cedoula / Movies-ETL

Star

Perform the Extract, Transform and Load (ETL) process to create a data pipeline on movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL.

python postgres json csv sql etl postgresql jupyter-notebook pandas pgadmin4 etl-framework etl-pipeline

Updated Oct 12, 2022
Jupyter Notebook

wednesday-solutions / aws-glue-jupyter-notebook-starter

Star

A starter repository for your next AWS Glue project. This comes with complete IaC, a CD pipeline and a reusable common SDK. Set up jupyter notebook for AWS Glue locally

aws jupyter etl glue data-engineering de aws-glue jupyter-notbook

Updated Sep 6, 2023
Jupyter Notebook

paladique / codespaces-etl-basic-demo

Star

ETL with Jupyter Notebooks, Pandas, and Azure Cosmos DB

etl azure pandas data-engineering azure-cosmos-db codespaces

Updated Oct 5, 2023
Jupyter Notebook

autistic-symposium / tensorflow-for-deep-learning-py

Star

👾 my old deep learning notebooks (e.g., tensorflow examples, caffee, deep art, numpy)

machine-learning deep-learning neural-network etl tensorflow paper curated

Updated Nov 18, 2024
Jupyter Notebook

sagarrathi / Projects

Star

Various Data Analytics Projects based On Statistics in form of Notebook.

python data-science etl statistical-models

Updated Mar 3, 2020
HTML

BinariesGoalls / Udacity-Data-Engineering-Nanodegree

Star

This is a repository to hold the files and notebooks produced throughout my Udacity's Nanodegree Data Engineering program.

python aws postgres airflow spark cassandra etl data-engineering data-pipelines data-modeling data-warehouses data-lakes

Updated Dec 5, 2022
PLpgSQL

farhanzafrani / Data-Science-Mentorship-Program

Star

This repo contain all the notebooks and the code of the Data Science Mentorship Program offered by Campusx youtube channel.

python machine-learning statistics etl numpy probability plotly eda pandas seaborn matplotlib tableau

Updated Aug 26, 2023
Jupyter Notebook

mar1boroman / databricks-patterns

Star

Common ETL patterns and utilities for PySpark. Notebooks tested on Databricks Community edition

data-science spark etl pyspark data-engineering databricks etl-framework cloud-migration databricks-notebooks databricks-email databricks-etl

Updated Sep 3, 2022
Jupyter Notebook

I2DSR / data-science-ipython-notebooks

Star

Data science encompasses a wide range of areas, topics, and sub-domains such as Big Data, Machine & Deep learning (ETL, TensorFlow, Keras), Data Mining/Visualization (EDA), BI, Predictive Analytics, Statistical Analytics, etc.

python data-science machine-learning data-mining r big-data deep-learning etl tensorflow exploratory-data-analysis keras data-visualization statistical-analysis business-intelligence predictive-analytics big-data-analytics

Updated May 3, 2024

halpeter / ETL-Project

Star

Using data extracted from Kaggle on the top restaurants from 2020, this project utilized Python scripting in Jupyter Notebook to transform and clean the data and finally, load the cleaned data frames into a PostgreSQL database.

etl extract transformations load

Updated Mar 29, 2021
Jupyter Notebook

zhenghao0379 / py_etl

Star

python etl demo by jupyter notebook

mysql python etl python3 azkaban

Updated May 22, 2020
Python

epomatti / az-databricks-etl

Star

Sample notebooks on Azure Databricks for ETL

apache-spark etl azure terraform databricks synapse azure-databricks azure-synapse-analytics

Updated May 20, 2023
Scala

sarahrosegallagher / AWS_RDS_ETL

Star

Jupyter Notebook ETL from AWS S3 bucket

etl aws-s3 jupyter-notebook data-analysis

Updated Jul 3, 2022
Jupyter Notebook

yso8 / data_integration_openfoodfacts

Star

./ PySpark OpenFoodFacts ETL: Extracts, transforms, and loads nutrition data for balanced weekly menus with SQLite3 integration.

jupyter etl notebook pyspark openfoodfacts

Updated Feb 18, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the etl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl

Here are 63 public repositories matching this topic...

jupyter-naas / naas

markwsutton / ETL-using-Python-SQL

EimisPacheco / Several-Jupyter-Notebooks

cvilla87 / PySpark-ETL-Telecom

elasticlabs / airflow-jupyter-docker-compose

souvik-databricks / dlt-with-debug

cedoula / Movies-ETL

wednesday-solutions / aws-glue-jupyter-notebook-starter

paladique / codespaces-etl-basic-demo

autistic-symposium / tensorflow-for-deep-learning-py

sagarrathi / Projects

BinariesGoalls / Udacity-Data-Engineering-Nanodegree

farhanzafrani / Data-Science-Mentorship-Program

mar1boroman / databricks-patterns

I2DSR / data-science-ipython-notebooks

halpeter / ETL-Project

zhenghao0379 / py_etl

epomatti / az-databricks-etl

sarahrosegallagher / AWS_RDS_ETL

yso8 / data_integration_openfoodfacts

Improve this page

Add this topic to your repo