Skip to main content

StreamFitter: A Real Time Linear Regression Analysis System for Continuous Data Streams

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6588))

Included in the following conference series:

Abstract

In this demo, we present the StreamFitter system for real-time linear regression analysis on continuous data streams. In order to perform regression on data streams, it is necessary to continuously update the regression model while receiving new data. In this demo, we will present two approaches for on-line, multi-dimensional linear regression analysis of stream data, namely Incremental Mathematical Stream Regression (IMSR) and Approximate Stream Regression (ASR). These methods dynamically recompute the regression model, considering not only the data records of the current window, but also the synopsis of the previous data. Therefore, the refined parameters more accurately model the entire data stream. The demo will show that the proposed methods are not only efficient in time and space, but also generate better fitted regression functions compared to the traditional sliding window methods and well-adapted to data changes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and Issues in Data Stream Systems. In: PODS (March 2002)

    Google Scholar 

  2. Brown, R.G., Meyer, R.F.: The fundamental theorem of exponential smoothing. Operations Research 9(5), 673–685 (1961)

    Article  MathSciNet  MATH  Google Scholar 

  3. Keogh, E., Chu, S., Hart, D., Pazzani, M.: Segmenting Time Series: A Survey And Novel Approach. In: Data Mining in Time Series Databases. World Scientific Publishing Company, Singapore (2004)

    Google Scholar 

  4. Berk, R.A.: Regression Analysis: A Constructive Critique. Sage Publications, Thousand Oaks (2004)

    Book  Google Scholar 

  5. Tokyo Stock Exchange, http://www.tse.or.jp/english/market/topix/data/index.html

  6. Tropical Atmosphere Ocean  project, http://www.pmel.noaa.gov/tao/index.shtml

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nadungodage, C.H., Xia, Y., Li, F., Lee, J.J., Ge, J. (2011). StreamFitter: A Real Time Linear Regression Analysis System for Continuous Data Streams. In: Yu, J.X., Kim, M.H., Unland, R. (eds) Database Systems for Advanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20152-3_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20152-3_39

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20151-6

  • Online ISBN: 978-3-642-20152-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics