About
I am a Principal Research SDE at the Data Systems (opens in new tab) group of Microsoft Research. I received my Ph.D. and M.Sc. from the University of Massachusetts Amherst under the supervision of Prof. Gerome Miklau (opens in new tab) and Prof. Alexandra Meliou (opens in new tab). I got my B.Sc. in Computer Science from Fudan University. I am interested in data mining and database management systems.
I have been working on Data Integration and Index Tuning. E.g., Recently we developed Index Tuning in Azure PostgreSQL (opens in new tab). Earlier, I developed a novel scale-out deduplication (opens in new tab) library and delivered it in Dynamics 365 Customer Insights (opens in new tab). I optimized and shipped Fuzzy Join in Power BI Desktop (opens in new tab), Power Query in Excel (opens in new tab), Customer Insights (opens in new tab), and Azure Data Factory (opens in new tab), and Fuzzy Group By in Power Query Online (opens in new tab). I optimized and delivered Add Column from Examples in Power BI Desktop (opens in new tab) and Power Query in Excel (opens in new tab). These are the outcome of collaborations with my teammates and managers such as Yeye He, Wentao Wu, Gaoxiang Xu, Manoj Syamala, Kukjin Lee, Vivek Narasayya, Surajit Chaudhuri, and numerous excellent engineers, product managers, and scientists from Azure PostgreSQL, Customer Insights, Power Query, Azure Data Factory, etc.
Service Activities:
Program Committee Member of KDD (ADS) ’23 (Excellence of Reviewing), ’22; WSDM ’23, ’22; ICDE (Demo) ’23; CIKM ’23, ’22, ’21, ’20; ECML-PKDD ’23, ’22, ’21, ’20; etc.
External Reviewer of VLDB ’22, ’18; ICDE ’22, ’20; KDD ’20, ’19; AAAI ’19; WSDM ’19; CIKM ’19, ’18; PODS ’17; etc.
Glad to review papers on data integration, data exploration, data mining, and database!