DataChat is an interactive web application that lets you analyze and explore your datasets using natural language. Simply upload your CSV or Excel file, and start asking questions about your data in plain English. DataChat leverages the power of Ollama (gemma:2b) for language understanding and LangChain for seamless integration with data analysis tools.
- Intuitive Chat Interface: Ask questions about your data in a conversational manner.
- Support for CSV and Excel Files: Easily upload your data from common file formats.
- Powerful Data Analysis: Get answers to questions about trends, statistics, distributions, and relationships within your data.
- Seamless Integration: Leverages LangChain to connect Ollama with data analysis tools like Pandas.
-
Prerequisites:
- Python 3.12
- Ollama installed and running (see: https://ollama.ai/)
-
Installation:
git clone [https://github.com/WizKnight/DataChat] cd DataChat pip install -r requirements.txt
-
Running the App:
streamlit run src/main.py
This will open DataChat in your web browser.
- Upload Data: Click "Select a file..." and choose your CSV or Excel file.
- Start Chatting: Type your questions about the data in the chat input box.
- Get Answers: DataChat will process your question and display the answer in the chat.
- "What is the average age of customers?"
- "How many sales were made in each region?"
- "Which product category has the highest revenue?"
- "Show me a histogram of customer ages."
- LLM: Experiment with different Ollama models or versions for improved performance.
- Data Analysis Tools: Extend DataChat by adding custom LangChain tools to integrate with other analysis libraries or APIs.
- UI/UX: Customize the Streamlit interface to match your preferences.
This project is licensed under the Apache 2.0 License.
- This project is inspired by the growing potential of large language models and their application to data analysis.
- I thank the developers of Ollama, LangChain, and Streamlit for their excellent tools and resources.