Back to Browse

Explainable Data Drift for NLP

327 views
Oct 9, 2023
29:31

Install NLP Libraries https://www.johnsnowlabs.com/install/ Watch all NLP Summit 2023 sessions: https://www.nlpsummit.org/nlp-summit-2023-watch-now/ Detecting data drift, although far from solved for Tabular data, has become a common practice as a way to monitor ML models in production. For Natural Language Processing on the other hand the question remains mostly open. In this talk, we will present and compare two approaches. First, we will demonstrate how by extracting a wide range of explainable properties per document such as topics, language, sentiment, named entities, keywords and more we are able to explore potential sources of drift. We will show how these properties can be consistently tracked over time, how they can be used to detect meaningful Data Drift as soon as it occurs and how they can be used to explain and fix the root cause. The second approach we’ll present is to detect drift by using the embeddings of common foundation models and use them to identify areas in the embedding space in which significant drift has occurred. These areas in embedding space should then be characterized in a human-readable way to enable root cause analysis of the detected drift. We’ll then compare the performance and explainability of these two methods, and explore the pros and cons of using each approach. Connect with us: Our website: https://www.johnsnowlabs.com/ LinkedIn: https://www.linkedin.com/company/johnsnowlabs/ Facebook: https://www.facebook.com/JohnSnowLabsInc/ Twitter: https://twitter.com/JohnSnowLabs

Download

0 formats

No download links available.

Explainable Data Drift for NLP | NatokHD