News & features
SIBYL: A machine learning-based framework for forecasting dynamic workloads
| Rana Alotaibi, Hanxian Huang, Tarique Siddiqui, Carlo Curino, Jesús Camacho Rodríguez, and Yuanyuan Tian
SIBYL is a machine learning model that makes highly accurate predictions of database queries, enabling tuning for more efficiency. Applying traditional database optimizations to these predicted queries helps maintain high performance as demands change.
LST-Bench: A new benchmark tool for open table formats in the data lake
| Jesús Camacho Rodríguez, Ashvin Agrawal, Anja Gruenheid, Ashit Gosalia, Cristian Petculescu, Josep Aguilar-Saborit, Avrilia Floratou, Carlo Curino, and Raghu Ramakrishnan
LST-Bench is a new open-source benchmark designed to evaluate table formats in cloud environments. It extends existing benchmarks to better reflect real-world usage & performance of data lakes and easily integrates with commonly used analytical engines.
Enhanced autoscaling with VASIM: Vertical Autoscaling Simulator Toolkit
| Anna Pavlenko, Karla Saur, Yiwen Zhu, Brian Kroth, Joyce Cahoon, and Jesús Camacho Rodríguez
Autoscaling can optimize cloud resource usage and costs by adjusting to demand. VASIM shows that simplifying testing and refinement of autoscaling algorithms can enable rapid development and evaluation of more efficient & cost-effective autoscaling strategies.
Research Focus: Week of February 19, 2024
In this issue: CaaSPER: vertical autoscaling algorithm dynamically maintains optimal CPU utilization; Improved scene landmark detection for camera localization runs faster, uses less storage; ESUS simplifies usability questionnaires for technical products and services.
Research Focus: Week of September 11, 2023
In this issue: Efficient polyglot analytics on semantic data aids query performance; generative retrieval for conversational question answering improves dialogue-based interfaces; a new tool uses ML to address capacity degradation in lithium-ion batteries.
Awards | Share the Tensor Tea: How Databases can Leverage the Machine Learning Ecosystem
Best Demonstration Award, VLDB 2022
We are announcing Hummingbird, a library for accelerating inference (scoring/prediction) in traditional machine learning models. Internally, Hummingbird compiles traditional ML pipelines into tensor computations to take advantage of the optimizations that are being implemented for neural network systems.
In the news | InfoWorld
Understanding Azure Arc
One of the more interesting announcements at Microsoft’s 2019 Ignite conference was Azure Arc, a new management tool for hybrid cloud application infrastructures. Building on Azure concepts, Arc is designed to allow you to manage on-premises resources from the Azure…
Applied Research; Insightful Impact
We take an ‘applied research’ approach. Our scientists collaborate with our engineering and product teams to solve hard technical problems. Coupled with a direct line to data, resources, engineers, and the voice of the customer, GSL retains the…