cleanlab-tools

Cookbooks showcasing various applications of Cleanlab, as well as code shared for education, reproducibility, transparency.

Detecting LLM Errors with Cleanlab's Trustworthy Language Model

Example	Description
TLM-Demo-Notebook	Demo-ing various applications of the Trustworthy Language Model, particularly in customer support
tlm_call_api_directly	Call the TLM REST API directly. You can use any programming language (eg. Typescript) with http lib/tools by providing the necessary payload and headers.
Trustworthy RAG with LlamaIndex	Run Cleanlab in RAG apps built with LlamaIndex for real-time detection of incorrect responses and root cause analysis.
Trustworthy RAG with MongoDB	Run Cleanlab in RAG apps built with MongoDB for real-time detection of incorrect responses and root cause analysis.
Customer Support AI Agent with NeMo Guardrails	Reliable customer support AI Agent with Guardrails and trustworthiness scoring (Nvidia Blogpost)
Better LLM Evals in MLFlow	Automatically find the bad LLM responses lurking in your production logs/traces via trustworthiness scoring in MLFlow
TLM-PII-Detection	Find and mask PII with the Trustworthy Language Model
Detecting GDPR Violations with TLM	Analyze application logs using TLM to detect GDPR violations
TLM-Record-Matching	Using the Trustworthy Language Model to reliably match records between two different data tables
fine_tuning_data_curation	Automatically detect bad data in instruction-tuning (LLM fine-tuning) datasets

Data Curation with Cleanlab Studio

Example	Description
few_shot_prompt_selection	Clean the pool of few-shot examples to improve prompt template for OpenAI LLM
fine_tuning_classification	Use Cleanlab Studio to improve the accuracy of fine-tuned LLMs for classification tasks
fine_tuning_mistral_beavertails	Analyze human annotated AI-safety-related labels (like toxicity) using Cleanlab Studio, and thus generate safer responses from LLMs
Evaluating_Toxicity_Datasets_Large_Language_Models	Analyze toxicity annotations in the Jigsaw dataset using Cleanlab Studio
time_series_automl	Model time series data in a tabular format and use Cleanlab Studio AutoML to achieve high prediction accuracy

Miscellaneous Code

Example	Description
TLM-SimpleQA-Benchmark	Benchmarking TLM and OpenAI LLMs on the SimpleQA dataset
benchmarking_hallucination_metrics	Evaluate the performance of popular real-time hallucination detection methods on RAG benchmarks
benchmarking_hallucination_model	Evaluate the performance of popular hallucination detection models on RAG benchmarks
gpt4-rag-logprobs	Obtaining logprobs from a GPT-4 based RAG system
generate_llm_response	Generate LLM responses for customer service requests using Llama 2 and OpenAI's API

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

cleanlab-tools

Detecting LLM Errors with Cleanlab's Trustworthy Language Model

Data Curation with Cleanlab Studio

Miscellaneous Code

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 13

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 246 Commits
NeMo-Guardrails-Customer-Support		NeMo-Guardrails-Customer-Support
TLM-AgentLite-Benchmark		TLM-AgentLite-Benchmark
TLM-Demo-Notebook		TLM-Demo-Notebook
TLM-MLflow-Integration		TLM-MLflow-Integration
TLM-PII-Detection		TLM-PII-Detection
TLM-Record-Matching		TLM-Record-Matching
TLM-SimpleQA-Benchmark		TLM-SimpleQA-Benchmark
TLM-intro		TLM-intro
TLM-o1-benchmark		TLM-o1-benchmark
benchmarking_hallucination_metrics		benchmarking_hallucination_metrics
benchmarking_hallucination_model		benchmarking_hallucination_model
few_shot_prompt_selection		few_shot_prompt_selection
fine_tuning_classification		fine_tuning_classification
fine_tuning_data_curation		fine_tuning_data_curation
fine_tuning_mistral_beavertails		fine_tuning_mistral_beavertails
gdpr_tlm_blog_post		gdpr_tlm_blog_post
generate_llm_response		generate_llm_response
gpt4-rag-logprobs		gpt4-rag-logprobs
jigsaw_ai_safety_keras		jigsaw_ai_safety_keras
time_series_automl		time_series_automl
tlm_call_api_directly		tlm_call_api_directly
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
trustworthyRAG_mongodb_cleanlab.ipynb		trustworthyRAG_mongodb_cleanlab.ipynb

License

cleanlab/cleanlab-tools

Folders and files

Latest commit

History

Repository files navigation

cleanlab-tools

Detecting LLM Errors with Cleanlab's Trustworthy Language Model

Data Curation with Cleanlab Studio

Miscellaneous Code

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 13

Uh oh!

Languages

Packages