A decoder-only foundation model for time-series forecasting

AI-generated keywords: Large Language Models

AI-generated Key Points

  • Introduction of TimesFM as a time-series foundation model for forecasting
  • Impressive zero-shot performance on various public datasets, rivaling state-of-the-art supervised forecasting models
  • Core of the model is pretraining a patched-decoder style attention architecture on a vast time-series corpus
  • Data used for pretraining includes Google Trends, Wiki Pageviews, and synthetic time-series
  • Pretraining process involves around 100 billion time-points overall using a patched-decoder style attention architecture with approximately 200 million parameters
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Abhimanyu Das, Weihao Kong, Rajat Sen, Yichen Zhou

License: CC BY 4.0

Abstract: Motivated by recent advances in large language models for Natural Language Processing (NLP), we design a time-series foundation model for forecasting whose out-of-the-box zero-shot performance on a variety of public datasets comes close to the accuracy of state-of-the-art supervised forecasting models for each individual dataset. Our model is based on pretraining a patched-decoder style attention model on a large time-series corpus, and can work well across different forecasting history lengths, prediction lengths and temporal granularities.

Submitted to arXiv on 14 Oct. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2310.10688v3

, , , , Motivated by recent advancements in large language models for Natural Language Processing (NLP), we introduce TimesFM, a time-series foundation model for forecasting. Our model showcases impressive zero-shot performance on various public datasets, rivaling the accuracy of state-of-the-art supervised forecasting models tailored to each dataset. The core of our model lies in pretraining a patched-decoder style attention architecture on a vast time-series corpus that encompasses real-world and synthetic data. To ensure our pretraining corpus captures the diverse forecasting use-cases we aim to address, we draw data from three primary sources: Google Trends, Wiki Pageviews, and synthetic time-series. Google Trends provides search interest trends for approximately 22k head queries over 15 years, offering hourly, daily, weekly, and monthly granularities totaling around 1 billion time-points. Wiki Pageviews offers hourly views of all Wikimedia pages from Jan. 2012 to Nov. 2023, amounting to roughly 100 billion time-points after cleaning and aggregation. In addition to these real-world sources, we generate synthetic data representing ARMA processes, seasonal patterns, trends with change-points, and step functions. This synthetic data comprises 3 million time-series of length 2048 time-points each. By combining these diverse datasets in our pretraining process involving around 100 billion time-points overall using a patched-decoder style attention architecture with approximately 200 million parameters. Looking ahead, we aim to delve deeper into understanding how our foundation model performs well on out-of-distribution data and explore its fine-tuning/few-shot capabilities. Overall, this work contributes significantly to advancing the field of Time-Series Forecasting using Machine Learning techniques with potential societal implications that warrant further exploration.
Created on 15 Mar. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.