- AutorIn
- Wolfgang Lehner Technische Universität Dresden, Fakultät Informatik, Institut für Systemarchitektur, Professur für Datenbanken
- Philipp RöschTechnische Universität Dresden, Fakultät Informatik, Institut für Systemarchitektur, Professur für Datenbanken
- Titel
- Sample synopses for approximate answering of group-by queries
- Zitierfähige Url:
- https://nbn-resolving.org/urn:nbn:de:bsz:14-qucosa2-767310
- Konferenz
- EDBT '09: 12th International Conference on Extending Database Technology: Advances in Database Technology. Saint Petersburg, Russia, 24.-26. März 2009
- Quellenangabe
- Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology (EDBT ´09)
Erscheinungsort: New York, NY
Verlag: ACM
Erscheinungsjahr: 2009
Seiten: 403-414
ISBN: 978-1-60558-422-5 - Erstveröffentlichung
- 2009
- Abstract (EN)
- With the amount of data in current data warehouse databases growing steadily, random sampling is continuously gaining in importance. In particular, interactive analyses of large datasets can greatly benefit from the significantly shorter response times of approximate query processing. Typically, those analytical queries partition the data into groups and aggregate the values within the groups. Further, with the commonly used roll-up and drill-down operations a broad range of group-by queries is posed to the system, which makes the construction of highly-specialized synopses difficult. In this paper, we propose a general-purpose sampling scheme that is biased in order to answer group-by queries with high accuracy. While existing techniques focus on the size of the group when computing its sample size, our technique is based on its standard deviation. The basic idea is that the more homogeneous a group is, the less representatives are required in order to give a good estimate. With an extensive set of experiments, we show that our approach reduces both the estimation error and the construction cost compared to existing techniques.
- Andere Ausgabe
- Link zum Artikel, der zuerst in der ACM Digital Library erschienen ist.
DOI: 10.1145/1516360.1516408 - Freie Schlagwörter (DE)
- Datenbankverwaltungssystem, Verarbeitung von Datenbankabfragen, Datenbanktheorie, Algorithmen für Anwendungsdomänen
- Freie Schlagwörter (EN)
- database management system, database query processing, database theory, algorithms for application domains
- Klassifikation (DDC)
- 004
- Verlag
- ACM, New York
- Version / Begutachtungsstatus
- angenommene Version / Postprint / Autorenversion
- URN Qucosa
- urn:nbn:de:bsz:14-qucosa2-767310
- Veröffentlichungsdatum Qucosa
- 22.04.2022
- Dokumenttyp
- Konferenzbeitrag
- Sprache des Dokumentes
- Englisch
- Lizenz / Rechtehinweis