Scientific Discovery Within Data Streams

Cowell, Andrew J.; Havre, Sue; May, Richard; Sanfilippo, Antonio

doi:10.1007/978-3-540-32263-4_4

Andrew J. Cowell¹⁹,
Sue Havre¹⁹,
Richard May¹⁹ &
…
Antonio Sanfilippo¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3345))

792 Accesses
2 Citations

Abstract

The term ‘data-stream’ is an increasingly overloaded expression. It often means different things to different people, depending on domain, usage or operation. Harold (2003) draws the following analogy:

“A [stream] analogy might be a queue of people waiting to get on a ride at an amusement park. As people are processed at the front (i.e. get on the roller coaster) more are added at the back of the line. If it’s a slow day the roller coaster may catch up with the end of the line and have to wait for people to board. Other days there may always be people in line until the park closes...There’s always a definite number of people in line though this number may change from moment to moment as people enter at the back of the line and exit from the front of the line. Although all the people are discrete, you’ll sometimes have a family that must be put together in the same car. Thus although the individuals are discrete, they aren’t necessarily unrelated.”

For our purposes we define a data-stream as a series of data (e.g. credit card transactions arriving at a clearing office, cellular phone traffic or environmental data from satellites) arriving in real time, that have an initiation, a continuous ingest of data, but with no expectations on the amount, length, or end of the data flow. The data stream does not have a database or repository as an intrinsic part of its definition–it is a ‘one-look’ opportunity from the perspective of data stream analytics. We call each data element in the stream a token and the complexity of these tokens ranges from simple (e.g. characters in a sentence: “T H I S I S A S T R E A M...”) to extremely complex (e.g. a detailed transaction record). The volume of data-streams is usually massive, and while each individual token may be rather uninformative, taken as a whole they describe the nature of the changing phenomena over time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lyman, P., Varian, H.: How Much Information? A project report of the Regents of the University of California (2000), available at, http://www.sims.berkeley.edu/how-much-info
Gorton, I., Almquist, J., Cramer, N., Haack, J., Hoza, M.: An Efficient, Scalable Content-Based Messaging System. In: Proc. 7th IEEE International Enterprise Distributed Object Computing Conference (EDOC 2003), Brisbane, Australia, pp. 278–285 (September 2003)
Google Scholar
Pacific Northwest National Laboratory’s IT Showcase, http://showcase.pnl.gov/show?it/triver-prod
Havre, S.L., Hetzler, B.G., Whitney, P.D., Nowell, L.T.: ThemeRiver: Visualizing Thematic Changes in Document Collections. IEEE Transactions on Visualization and Computer Graphics 8(1), 9–20 (2002)
Google Scholar
Hoffman, D.D.: Visual Intelligence: How We Create What We See. W.W. Norton and Company, Inc., New York (1998)
Google Scholar
Ware, C.: Information Visualization: Perception for Design. Academic Press, San Diego (2000)
Google Scholar
May, R.: Hi-Space: A Next Generation Workspace Environment. Masters Thesis in Electrical Engineering Computer Science. Pullman, Washington, Washington State University
Google Scholar
http://showcase.pnl.gov/show?it/hispace-prod
http://www.hitl.washington.edu/hispace.html
MacEachren, M.A., et al.: Visually-Enabled Geocollaboration to Support Data Exploration and Decision-Making. In: Proceedings of the 21st International Cartographic Conference, Durban, South Africa, pp. 10–16 (August 2003)
Google Scholar
Ullmer, B., Ishii, H.: The metaDESK: Models and Prototypes for Tangible User Interfaces. In: UIST (1997)
Google Scholar
Matsushita, N., Rekimoto, J.: HoloWall: Designing a Finger, Hand, Body, and Object Sensitive Wall. In: UIST (1997)
Google Scholar
Krueger, M.W.: Artificial Reality II. Addison-Wesley Publishing Company, Reading (1991)
Google Scholar
Wellner, P.: Interactions with Paper on the DigitalDesk. Communications of the ACM 36(7), 87–96 (1993)
Article Google Scholar
Ohshima, T., Sato, K., et al.: AR2 Hockey; A Case Study of Collaborative Augmented Reality. In: VRAIS (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Pacific Northwest National Laboratory, USA
Andrew J. Cowell, Sue Havre, Richard May & Antonio Sanfilippo

Authors

Andrew J. Cowell
View author publications
You can also search for this author in PubMed Google Scholar
Sue Havre
View author publications
You can also search for this author in PubMed Google Scholar
Richard May
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Sanfilippo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Ambient Intelligence Lab, CIC-2218, Carnegie Mellon University, 4720 Forbes Avenue, 15213, Pittsburgh, PA, USA
Yang Cai

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Cowell, A.J., Havre, S., May, R., Sanfilippo, A. (2005). Scientific Discovery Within Data Streams. In: Cai, Y. (eds) Ambient Intelligence for Scientific Discovery. Lecture Notes in Computer Science(), vol 3345. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-32263-4_4

Download citation

DOI: https://doi.org/10.1007/978-3-540-32263-4_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24466-0
Online ISBN: 978-3-540-32263-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics