Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3345))

Abstract

The term ‘data-stream’ is an increasingly overloaded expression. It often means different things to different people, depending on domain, usage or operation. Harold (2003) draws the following analogy:

“A [stream] analogy might be a queue of people waiting to get on a ride at an amusement park. As people are processed at the front (i.e. get on the roller coaster) more are added at the back of the line. If it’s a slow day the roller coaster may catch up with the end of the line and have to wait for people to board. Other days there may always be people in line until the park closes...There’s always a definite number of people in line though this number may change from moment to moment as people enter at the back of the line and exit from the front of the line. Although all the people are discrete, you’ll sometimes have a family that must be put together in the same car. Thus although the individuals are discrete, they aren’t necessarily unrelated.”

For our purposes we define a data-stream as a series of data (e.g. credit card transactions arriving at a clearing office, cellular phone traffic or environmental data from satellites) arriving in real time, that have an initiation, a continuous ingest of data, but with no expectations on the amount, length, or end of the data flow. The data stream does not have a database or repository as an intrinsic part of its definition–it is a ‘one-look’ opportunity from the perspective of data stream analytics. We call each data element in the stream a token and the complexity of these tokens ranges from simple (e.g. characters in a sentence: “T H I S I S A S T R E A M...”) to extremely complex (e.g. a detailed transaction record). The volume of data-streams is usually massive, and while each individual token may be rather uninformative, taken as a whole they describe the nature of the changing phenomena over time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lyman, P., Varian, H.: How Much Information? A project report of the Regents of the University of California (2000), available at, http://www.sims.berkeley.edu/how-much-info

  2. Gorton, I., Almquist, J., Cramer, N., Haack, J., Hoza, M.: An Efficient, Scalable Content-Based Messaging System. In: Proc. 7th IEEE International Enterprise Distributed Object Computing Conference (EDOC 2003), Brisbane, Australia, pp. 278–285 (September 2003)

    Google Scholar 

  3. Pacific Northwest National Laboratory’s IT Showcase, http://showcase.pnl.gov/show?it/triver-prod

  4. Havre, S.L., Hetzler, B.G., Whitney, P.D., Nowell, L.T.: ThemeRiver: Visualizing Thematic Changes in Document Collections. IEEE Transactions on Visualization and Computer Graphics 8(1), 9–20 (2002)

    Google Scholar 

  5. Hoffman, D.D.: Visual Intelligence: How We Create What We See. W.W. Norton and Company, Inc., New York (1998)

    Google Scholar 

  6. Ware, C.: Information Visualization: Perception for Design. Academic Press, San Diego (2000)

    Google Scholar 

  7. May, R.: Hi-Space: A Next Generation Workspace Environment. Masters Thesis in Electrical Engineering Computer Science. Pullman, Washington, Washington State University

    Google Scholar 

  8. http://showcase.pnl.gov/show?it/hispace-prod

  9. http://www.hitl.washington.edu/hispace.html

  10. MacEachren, M.A., et al.: Visually-Enabled Geocollaboration to Support Data Exploration and Decision-Making. In: Proceedings of the 21st International Cartographic Conference, Durban, South Africa, pp. 10–16 (August 2003)

    Google Scholar 

  11. Ullmer, B., Ishii, H.: The metaDESK: Models and Prototypes for Tangible User Interfaces. In: UIST (1997)

    Google Scholar 

  12. Matsushita, N., Rekimoto, J.: HoloWall: Designing a Finger, Hand, Body, and Object Sensitive Wall. In: UIST (1997)

    Google Scholar 

  13. Krueger, M.W.: Artificial Reality II. Addison-Wesley Publishing Company, Reading (1991)

    Google Scholar 

  14. Wellner, P.: Interactions with Paper on the DigitalDesk. Communications of the ACM 36(7), 87–96 (1993)

    Article  Google Scholar 

  15. Ohshima, T., Sato, K., et al.: AR2 Hockey; A Case Study of Collaborative Augmented Reality. In: VRAIS (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Cowell, A.J., Havre, S., May, R., Sanfilippo, A. (2005). Scientific Discovery Within Data Streams. In: Cai, Y. (eds) Ambient Intelligence for Scientific Discovery. Lecture Notes in Computer Science(), vol 3345. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-32263-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-32263-4_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24466-0

  • Online ISBN: 978-3-540-32263-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics