Reduced data collection and annotation costs<\/li>\n<\/ul>\n\n\n\nTime series prototypes: Building blocks of cross-domain generalization<\/h2>\n\n\n\n TimeDP\u2019s innovation lies in its use of time-series prototypes\u2014modular, reusable patterns that capture the fundamental characteristics of time-series data, like trends, fluctuations, or periodicity. These prototypes act like vocabulary words in a language model, representing the core \u201cstyle\u201d elements of various domains. This structural similarity between language components and time-series prototypes is illustrated in Figure 1.<\/p>\n\n\n\nFigure 1. Time-series prototypes form domain prompts that describe time-series styles (right), similar to how prompts guide outputs in language models (left).<\/figcaption><\/figure>\n\n\n\nTimeDP leverages these building blocks to create domain prompts, enabling it to generate data tailored to new domains without requiring labeled training data. The model architecture, shown in Figure 2, includes three core components:<\/p>\n\n\n\n
\nTime-series prototypes<\/strong>: Capture core elements like seasonal trends or volatility, allowing the model to flexibly combine patterns to synthesize domain-specific data.<\/li>\n\n\n\nPAM<\/strong>: Assigns relevant prototypes to input samples, helping the model adapt to new domains during the training and generation phases.<\/li>\n\n\n\nCross-domain prompts<\/strong>: Derived automatically from a few examples, these prompts guide generation without the need for manually provided labels.<\/li>\n<\/ul>\n\n\n\nFigure 2. The TimeDP model framework<\/figcaption><\/figure>\n\n\n\nEvaluating TimeDP: Validating consistency across multiple domains<\/h2>\n\n\n\n To test the model’s effectiveness, researchers evaluated TimeDP on 12 real-world datasets spanning four domains: energy, transportation, weather, and finance. Using evaluation metrics including Maximum Mean Discrepancy (MMD) and Kullback-Leibler (KL) divergence which measure the similarity between synthetic and real data, the team compared TimeDP\u2019s output to both real-world data and the outputs of other state-of-the-art models.<\/p>\n\n\n\n
The results, shown in Figure 2, are impressive. In intra-domain scenarios\u2014where training and test data come from the same domain\u2014TimeDP reduced MMD by an average of 25.9% and KL divergence by 53.0%, indicating a strong alignment between generated and real data, and significantly outperforming baseline models.<\/p>\n\n\n\n Table 1. Intra-domain generation results<\/figcaption><\/figure>\n\n\n\nTimeDP also excels in unseen domains\u2014those to which the model had no prior exposure during training. With just a few samples\u2014and without any fine-tuning\u2014TimeDP generated data that closely mirrored the statistical properties of real datasets. It outperformed fine-tuned baseline models, demonstrating robust generalization capabilities. The results are shown in Table 2.<\/p>\n\n\n\nTable 2. Generation results for unseen domains<\/figcaption><\/figure>\n\n\n\nTimeDP and the future of synthetic time-series data<\/h2>\n\n\n\n As demand grows for high-quality time-series data across industries, synthetic data offers a practical solution to challenges like privacy protection and data scarcity. By generating artificial data that preserves statistical patterns, TimeDP protects sensitive information, especially in fields like healthcare. Its ability to learn from a small number of unlabeled examples reduces reliance on large, labeled datasets that are often costly or difficult to obtain, making it particularly valuable in low-resource or privacy-sensitive settings.<\/p>\n\n\n\n
Future research will focus on expanding TimeDP\u2019s capabilities by incorporating domain knowledge, responding to user input through natural language, and adapting to shifting data environments. As part of a broader move toward more general-purpose synthetic time-series generation tools, TimeDP marks a promising step in supporting AI development across diverse and dynamic domains.<\/p>\n","protected":false},"excerpt":{"rendered":"
Time-series data\u2014measurements collected over time like stock prices or heart rates\u2014plays a vital role in AI forecasting systems across industries. As these systems advance, the need for time-series data is increasing, especially synthetic data, which offers numerous advantages over real-world data. In healthcare, synthetic data protects patient privacy; in finance, it enables risk-free testing of […]<\/p>\n","protected":false},"author":34512,"featured_media":1135180,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-content-parent":199560,"msr_hide_image_in_river":null,"footnotes":""},"research-area":[13556],"msr-locale":[268875],"msr-post-option":[],"class_list":["post-1139137","msr-blog-post","type-msr-blog-post","status-publish","has-post-thumbnail","hentry","msr-research-area-artificial-intelligence","msr-locale-en_us"],"msr_assoc_parent":{"id":199560,"type":"lab"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1139137","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-blog-post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/34512"}],"version-history":[{"count":6,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1139137\/revisions"}],"predecessor-version":[{"id":1141761,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1139137\/revisions\/1141761"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/1135180"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=1139137"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=1139137"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=1139137"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=1139137"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}