DIMACS Workshop on Data Quality, Data Cleaning and

Monday, November 3, 2003 - Tuesday, November 4, 2003
DIMACS (Center for Discrete Mathematics and Theoretical Computer Science) Piscataway, NJ
The word "data" has taken on a broad meaning in the last five
years. It is no longer a set of numbers or even text. New data
paradigms include data streams characterized by a high rate of
accumulation, web scraped documents and tables, web server logs,
images, audio and video, to name a few. Well-known challenges of
heterogeneity and scale continue to grow as data are integrated from
disparate sources and become more complex in size and content.