Stars
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (…
nannyml: post-deployment data science in python
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
relplot: Utilities for measuring calibration and plotting reliability diagrams
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Open source platform for the machine learning lifecycle
Open-Source Web UI for Apache Kafka Management
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Streamlit — A faster way to build and share data apps.
A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.
A native Rust library for Delta Lake, with bindings into Python
Amazon SageMaker Local Mode Examples
Open standard for machine learning interoperability
🦜🔗 Build context-aware reasoning applications
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
The fundamental package for scientific computing with Python.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Literature references for “Designing Data-Intensive Applications”
scikit-learn: machine learning in Python