About
I am a researcher in the Data Systems group at Microsoft Research. I finished my PhD at University of Wisconsin-Madison with Prof. Jeffrey Naughton.
Recently I have been working on Self-service Data Preparation (opens in new tab), where we develop technologies to automate a variety of data-preparation tasks in the context of data science and business intelligence workflows.
Our research has been recognized with best paper awards at VLDB and SIGMOD. Additionally, some of our technologies have been integrated into various Microsoft products and services, including Power Query (opens in new tab) for Power BI (opens in new tab) (program synthesis, operator recommendations), Excel (opens in new tab) (data cleansing, error detection in tables), Azure Machine Learning (opens in new tab) (data prep SDK), and Azure Purview (opens in new tab) (auto-tagging of data columns, data-quality suggestion for tables in data lakes).
Previously I worked on search engine query-log mining (Entity-Synonym (opens in new tab), Attribute-Synonym (opens in new tab), Acronym (opens in new tab), etc.), which are used in applications like Bing Snapp (opens in new tab) and Bing Knowledge Widget (opens in new tab).