data-storage
Here are 65 public repositories matching this topic...
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
-
Updated
Feb 14, 2025 - Python
Time Capsule continuously captures and stores digital activities to create a comprehensive memory system. It features real-time audio recording, speech-to-text with Fast-Whisper, plugin support, database storage via Chroma, and a web interface for management. Ideal for documenting life or building digital memories.
-
Updated
Oct 29, 2024 - Python
Scripts and config files for the secure Linux-based multiservice server on ZFS
-
Updated
Aug 3, 2021 - Python
BitDust project source codes : official Public Git repository (mirror on GitHub) : https://bitdust.io
-
Updated
Apr 22, 2025 - Python
BitDust project source codes development cycle, official Development Git repository (mirror on GitHub) : https://bitdust.io
-
Updated
Apr 22, 2025 - Python
Data Engine for Manual/Algo Trading: Download/Stream -> Clean -> Store. Supports Data Lakehouse Architecture. Clean Once and Forget.
-
Updated
Apr 26, 2025 - Python
A Python library for numpy arrays that persist on disk in a format that is simple, self-documented and tool-independent, and maximizes universal readability.
-
Updated
Nov 7, 2024 - Python
A lightweight local NoSQL database
-
Updated
Mar 16, 2024 - Python
A simple Python package for working with ROS2 bag files
-
Updated
Nov 14, 2023 - Python
IPython magic for simple, organized, compressed and encrypted: storage & transfer of files between notebooks.
-
Updated
Dec 8, 2022 - Python
🛠 Containerized and configurable Airflow ETL pipeline for collecting and storing stock and cryptocurrency market data.
-
Updated
Jan 12, 2025 - Python
-
Updated
Dec 1, 2021 - Python
An on-disk pythonic embedded key-value store for compressed data storage and distributed data analysis
-
Updated
Nov 4, 2024 - Python
SQL for numerical simulations.
-
Updated
Mar 12, 2025 - Python
Python Standard Library eXtension
-
Updated
Jun 10, 2023 - Python
A collection of programs for reading and writing data on cassette tapes with modern computers. Can also be used for storing data on other audio recording mediums and for transmitting data via audio cables.
-
Updated
Dec 30, 2019 - Python
Expense Tracker is a Python-based desktop application built with Tkinter and SQLite, designed for efficient expense management. It offers intuitive features for adding, categorizing, and analyzing expenses, making it ideal for personal or small business use.
-
Updated
Jun 23, 2024 - Python
The CNPJ Data ETL Pipeline is designed to automate the download, processing, and storage of public CNPJ data from the Brazilian Federal Revenue. The pipeline is built with Mage.ai and AWS S3 to ensure efficient data management and scalability.
-
Updated
Feb 3, 2025 - Python
-
Updated
Aug 11, 2017 - Python
Improve this page
Add a description, image, and links to the data-storage topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-storage topic, visit your repo's landing page and select "manage topics."