What is UpTrain? πŸ€”

Monitor and Improve your Machine Learning Models in Production

uptrainarrow-up-right

An open-source framework to observe ML applications, built for engineers

Docsarrow-up-right - Try it outarrow-up-right - Support Communityarrow-up-right - Bug Reportarrow-up-right - Feature Requestarrow-up-right

arrow-up-rightPRs Welcome arrow-up-right arrow-up-rightDocs arrow-up-rightCommunity arrow-up-rightWebsite arrow-up-rightOpen In Colabarrow-up-right

UpTrainarrow-up-right is an open-source, data-secure tool for ML practitioners to observe and refine their ML models by monitoring their performance, checking for (data) distribution shifts, and collecting edge cases to retrain them upon. It integrates seamlessly with your existing production pipelines and takes minutes to get started ⚑.

🚨Coming soon🚨

  • Label Shift - Identify drifts in your predictions. Specially useful in cases when ground truth is unavailable.

  • Model confidence interval - Confidence intervals for model predictions

  • Advanced drift detection techniques - Outlier-based drift detection methods

  • Advanced feature slicing - Ability to slice statistical properties

  • Kolmogorov-Smirnov Test - For detecting distribution shifts

  • Prediction Stability - Filter cases where model prediction is not stable.

  • Adversarial Checks - Combat adversarial attacks

And more.

Get started πŸ™Œ

You can quickly get started with Google collab herearrow-up-right.

To run it on your machine, follow the steps below:

Install the package through pip:

Run your first example:

For more info, visit our get started guide.

UpTrain in actionarrow-up-right 🎬

One of the most common use cases of ML today is language models, be it text summarization, NER, chatbots, language translation, etc. UpTrain provides ways to visualize differences in the training and real-world data via UMAP clustering of text embeddings (inferred from bert). Following are some replays from the UpTrain dashboard.

AI Explainability out-of-the-box

Live Model Performance Monitoring and Data Integrity Checks

UMAP Dimensionality Reduction and Visualization

Edge-case collection to finetune the model later

Why UpTrain πŸ€”?

Machine learning (ML) models are widely used to make critical business decisions. Still, no ML model is 100% accurate, and, further, their accuracy deteriorates over time 😣. For example, Sales prediction becomes inaccurate over time due to a shift in consumer buying habits. Additionally, due to the black box nature of ML models, it's challenging to identify and fix their problems.

UpTrain solves this. We make it easy for data scientists and ML engineers to understand where their models are going wrong and help them fix them before others complain πŸ—£οΈ.

UpTrain can be used for a wide variety of Machine learning models such as LLMs, recommendation models, prediction models, Computer vision models, etc.

We are constantly working to make UpTrain better. Want a new feature or need any integrations? Feel free to create an issuearrow-up-right or contributearrow-up-right directly to the repository.

Meme

License πŸ’»

This repo is published under Apache 2.0 license. We're currently focused on developing non-enterprise offerings that should cover most use cases. In the future, we will add a hosted version which we might charge for.

Stay Updated ☎️

We are continuously adding tons of features and use cases. Please support us by giving the project a star ⭐!

Provide feedback (Harsher the better πŸ˜‰)

We are building UpTrain in public. Help us improve by giving your feedback herearrow-up-right.

Contributors πŸ–₯️

We welcome contributions to uptrain. Please see our contribution guidearrow-up-right for details.

arrow-up-right

Last updated

Was this helpful?