{"id":714646,"date":"2020-12-30T02:54:35","date_gmt":"2020-12-30T10:54:35","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-project&p=714646"},"modified":"2021-10-13T21:15:01","modified_gmt":"2021-10-14T04:15:01","slug":"vert-versatile-entity-recognition-disambiguation-toolkit","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/vert-versatile-entity-recognition-disambiguation-toolkit\/","title":{"rendered":"VERT: Versatile Entity Recognition & Disambiguation Toolkit"},"content":{"rendered":"
While knowledge about entities is a key building block in the mentioned systems, creating effective\/efficient models for real-world scenarios remains a challenge (tech\/data\/real workloads).<\/p>\n
Based on such needs, we’ve created VERT<\/strong> – a V<\/strong>ersatile E<\/strong>ntity R<\/strong>ecognition & Disambiguation T<\/strong>oolkit. VERT<\/strong> is a pragmatic toolkit that combines rules and ML, offering both powerful pretrained models for core entity types (recognition and linking) and the easy creation of custom models. Custom models use our deep learning-based NER\/EL models that minimizes the needs for handcrafted features to quickly create deployeable models with state-of-the-art quality and performance, as well as facilitate refining pre-trained models.<\/p>\n VERT<\/strong> emphasizes requirements from real world use cases and it’s composed of 4 main modules:<\/p>\n\n
\nThis module is publicly available as a deterministic rule-based system designed for extensibility (Recognizers-Text<\/strong> (opens in new tab)<\/span><\/a>) supporting multiple languages. Moreover, vNext is on the way.<\/li>\n
\nThis module includes pre-trained models for typical entity types and available in 5 languages, as well as tools to train or refine new models.<\/li>\n