Session: PIIGuardRails on LLMs - Protecting Personally Identifiable Information
PII (Personally Identifiable Information ) relates to any data that can be used to identify a specific person. In LLM models, there is a risk that PII sensitive information may get leaked. Non-sensitive personal information can become a risk when that data is pieced together to identify a person indirectly. For instance, gender, birthday, ethnicity, or medical information may disclose a person’s identity when combined with other information.
This talk provides a novel ML-based engine to identify Personally Identifying Information for vast unstructured documents. It is the state-of-the-art engine which provides comprehensive PII coverage across multiple detectors, thus providing guardrails to LLM outputs.
Bio
Shubhi Asthana is a Sr Research Software Engineer who build AI & ML Solutions. She is SME in AI and ML models for Financial Services, along with leading the PII effort in Unstructured Data & NLP. Her research and development work spans the areas of Data Analytics, NLP and Cloud Services.