Rachel Wities

It’s Not Just the Doctor’s Handwriting: Challenges and Opportunities  in Healthcare NLP

NLP Researcher – Microsoft

Rachel Wities

It’s Not Just the Doctor’s Handwriting: Challenges and Opportunities  in Healthcare NLP

NLP Researcher – Microsoft

Bio

Rachel is an NLP researcher at Microsoft Healthcare and Life Sciences, and cofounder of Medical Data Science Israel (MeDs).

Bio

Rachel is an NLP researcher at Microsoft Healthcare and Life Sciences, and cofounder of Medical Data Science Israel (MeDs).

Abstract

During your last visit to your doctor, how much of your doctor’s attention was focused on you, and how much on her computer?

Doctors cope with the burden of processing more and more data, mostly in the form of free text.  NLP algorithms can alleviate this burden. However, NLP algorithms in the Healthcare domain have their own set of challenges: the scarcity of training due to privacy concerns, the lack of standardization, and the multilingual nature of medical data, all present a challenge for classic transformer-based NLP Models.

This lecture will discuss how those challenges were solved at Microsoft while working on Azure Text Analytics for Healthcare,  a service that provides deep analysis of multilingual medical texts. The lecture explores Federated Learning algorithms and how they were used  to solve data privacy issues, as well as the XLM-K Language Model and how it was used to train the model on seven languages with almost no training data. Finally, this lecture explains how medical ontologies (UMLS and others) can fill gaps in medical knowledge.

This talk is intended for women interested in the medical data science domain, as well as for data scientists coping with similar problems in other domains

Abstract

During your last visit to your doctor, how much of your doctor’s attention was focused on you, and how much on her computer?

Doctors cope with the burden of processing more and more data, mostly in the form of free text. NLP algorithms can alleviate this burden. However, NLP algorithms in the Healthcare domain have their own set of challenges: the scarcity of training due to privacy concerns, the lack of standardization, and the multilingual nature of medical data, all present a challenge for classic transformer-based NLP Models.

This lecture will discuss how those challenges were solved at Microsoft while working on Azure Text Analytics for Healthcare, a service that provides deep analysis of multilingual medical texts. The lecture explores Federated Learning algorithms and how they were used to solve data privacy issues, as well as the XLM-K Language Model and how it was used to train the model on seven languages with almost no training data. Finally, this lecture explains how medical ontologies (UMLS and others) can fill gaps in medical knowledge.

This talk is intended for women interested in the medical data science domain, as well as for data scientists coping with similar problems in other domains

Discussion Points

  • First, how to decide whether a labeled data is a must? 
  • Different types of labeling challenges we’ve dealt with as data scientists (partial labels, noisy labels, etc.)
  • Academic approaches that discuss possible solutions to these problems
  • Practical solutions we eventually implemented 
  • Interesting case studies and results

Discussion Points

  • First, how to decide whether a labeled data is a must? 
  • Different types of labeling challenges we’ve dealt with as data scientists (partial labels, noisy labels, etc.)
  • Academic approaches that discuss possible solutions to these problems
  • Practical solutions we eventually implemented 
  • Interesting case studies and results

Planned Agenda

8:45 Reception
9:30 Opening words by WiDS TLV ambassador Nitzan Gado and by Lily Ben Ami, CEO of the Michal Sela Forum
9:50 Prof. Bracha Shapira – Data Challenges in Recommender Systems Research: Insights from Bundle Recommendation
10:20 Juan Liu – Accounting Automation: Making Accounting Easier So That People Can Forget About It
10:50 Break
11:00 Lightning talks
12:20 Lunch & poster session
13:20 Roundtable session & poster session
14:05 Roundtable closure
14:20 Break
14:30 Merav Mofaz – “Every Breath You Take and Every Move You Make…I'll Be Watching You:” The Sensitive Side of Smartwatches
14:50 Reut Yaniv – Ad Serving in the Online Geo Space Along Routes
15:10 Rachel Wities - It’s Not Just the Doctor’s Handwriting: Challenges and Opportunities in Healthcare NLP
15:30 Closing remarks
15:40 End