Abstract
The technologies of mobile communications pervade our society and wireless networks sense the movement of people, generating large volumes of mobility data, such as mobile phone call records and Global Positioning System (GPS) tracks. In this work, we illustrate the striking analytical power of massive collections of trajectory data in unveiling the complexity of human mobility. We present the results of a large-scale experiment, based on the detailed trajectories of tens of thousands private cars with on-board GPS receivers, tracked during weeks of ordinary mobile activity. We illustrate the knowledge discovery process that, based on these data, addresses some fundamental questions of mobility analysts: what are the frequent patterns of people’s travels? How big attractors and extraordinary events influence mobility? How to predict areas of dense traffic in the near future? How to characterize traffic jams and congestions? We also describe M-Atlas, the querying and mining language and system that makes this analytical process possible, providing the mechanisms to master the complexity of transforming raw GPS tracks into mobility knowledge. M-Atlas is centered onto the concept of a trajectory, and the mobility knowledge discovery process can be specified by M-Atlas queries that realize data transformations, data-driven estimation of the parameters of the mining methods, the quality assessment of the obtained results, the quantitative and visual exploration of the discovered behavioral patterns and models, the composition of mined patterns, models and data with further analyses and mining, and the incremental mining strategies to address scalability.
Similar content being viewed by others
References
Abul, O., Bonchi, F., Nanni, M.: \({\mathcal{N}}\) ever \({\mathcal{W}}\) alk \({\mathcal{A}}\) lone: Uncertainty for anonymity in moving objects databases. In: Proceedings of the 24nd IEEE International Conference on Data Engineering (ICDE’08) 2008
Abul O., Bonchi F., Nanni M.: Anonymization of moving objects databases by clustering and perturbation. Inf. Syst. 35(8), 884–910 (2010)
Agenzia Milanese Mobilità e Ambiente. Indagine sulla mobilità delle persone dell’area milanese (2006)
Andrienko G., Andrienko N., Wrobel S.: Visual analytics tools for analysis of movement data. SIGKDD Explor. Newslett. 9(2), 38–46 (2007)
Andrienko, G., Andrienko, N., Rinzivillo, S., Nanni, M., Pedreschi, D., Giannotti, F.: Interactive visual clustering of large collections of trajectories. In: IEEE Visual Analytics Science and Technology (VAST 2009) 3–10 (2009)
Ankerst M., Breunig M.M., Kriegel H.P., Sander J.: Optics: ordering points to identify the clustering structure. SIGMOD 49–60 (1999)
Baglioni, M., de Macedo, J., Renso, C., Trasarti, R, Wachowicz, M. Towards semantic intepretation of movement data. In: AGILE Conference (2009)
Benkert M., Gudmundsson J., Hübner F., Wolle T.: Reporting flock patterns. Comput. Geom. Theory Appl. 41(3), 111–125 (2008)
Brockmann D., Hufnagel L., Geisel T.: The scaling laws of human travel. Nature 439, 462 (2006)
Chen, Z., Heng Tao, S., Zhou, X., Zheng, Y., Xie, X.: Searching trajectories by locations: an efficiency study. In: Proceedings of the 2010 International Conference on Management of data, SIGMOD ’10, pp. 255–266
Cudré-Mauroux, P., Wu, E., Madden, S.T.: An adaptive storage system for very large trajectory data sets. In: International Conference on Data Engineering, pp. 109–120 (2010)
De Raedt, L., Jaeger M., Lee, S.D., Mannila, H.: A theory of inductive query answering. In: IEEE International Conference on Data Mining (2002)
Ester, M., Kriegel, H-P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings on the Knowledge Discovery in Databases Conference, pp. 226–231 (1996)
Gaffney, S., Smyth, P.: Trajectory clustering with mixture of regression models. In: Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, pp. 63–72. ACM (1999)
GeoLife Web Site http://research.microsoft.com/en-us/projects/geolife
GeoPKDD website. http://www.geopkdd.eu
Giannotti, F., Nanni, M., Pinelli, F., Pedreschi, D.: Trajectory pattern mining. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining, pp. 330–339 (2007)
Giannotti, F., Pedreschi, D. (Eds.) Mobility, Data Mining and Privacy—Geographic Knowledge Discovery. Springer, Berlin (2008)
Gonzalez M., Hidalgo C.A., Barabási A.-L.: Understanding individual human mobility patterns. Nature 453, 779–782 (2008)
Gudmundsson, J., van Kreveld, M.: Computing Longest Duration Flocks in Trajectory Data. In: 14th Annual ACM International Symposium on Advances in Geographic Information Systems, pp. 35–42. New York: ACM
Güting R.H., Böhlen M.H., Erwig M., Jensen C.S., Lorentzos N.A., Schneider M., Vazirgiannis M.: A foundation for representing and querying moving objects. ACM Trans. Database Syst. 25(1), 1–42 (2000)
Imielinski T., Mannila H.: A database perspective on knowledge discovery. Commun. ACM 39(11), 58–64 (1996)
Johnson, T., Lakshmanan, L.V.S., Ng, R.T.: The 3W model and algebra for unified data mining. In: VLDB ’00: Proceedings of the 26th International Conference on Very Large Data Bases, pp. 21–32. Morgan Kaufmann, San Francisco (2000)
Kalnis, P., Mamoulis, N., Bakiras, S.: On discovering moving clusters in spatio-temporal data. In Proceedings of 9th International Symposium on Spatial and Temporal Databases (SSTD’05), pp. 364–381. Springer, Berlin (2005)
Lee, J.-G., Han, J., Whang, K.-Y.: Trajectory clustering: a partition-and-group framework. In: SIGMOD Conference, pp. 593–604 (2007)
Li, Z., Ji, M., Lee, J.-G., Tang, L.A., Yu, Y., Han, J., Kays, R.: Movemine: mining moving object databases. In: SIGMOD Conference, pp. 1203–1206 (2010)
Monreale, A., Pinelli, F., Trasarti, R, Giannotti, F.: Wherenext: a location predictor on trajectory pattern mining. In: 15th ACM SIGKDD Conference on Knoledge Discovery and Data Mining (KDD’09) (2009)
Monreale A., Andrienko G., Andrienko N., Giannotti F., Pedreschi D., Rinzivillo S., Wrobel S.: Movement data anonymity through generalization. Trans. Data Privacy 3(2), 91–121 (2010)
Nanni M., Pedreschi D.: Time-focused clustering of trajectories of moving objects. J. Intell. Inf. Syst. 27(3), 267–289 (2006)
Nanni, M., Trasarti, R., Renso, C., Giannotti, F., Pedreschi, D.: Advanced knowledge discovery on movement data with the GeoPKDD system. In: Proceedings of the 13th International Conference on Extending Database Technology, pp. 693–696 (2010)
Octotelematics http://www.octotelematics.it/
Olteanu, A.-M., Trasarti, R., Couronn, T., Giannotti, F., Nanni, M., Smoreda, Z., Ziemlicki, C.: GSM data analysis for tourism application. In: Proceedings of the 7th International Symposium on Spatial Data Quality (ISSDQ) (2011)
Ortale, R., Ritacco, E., Pelekis, N., Trasarti, R., Costa, G., Giannotti, F., Manco., Renso, C., Theodoridis, Y.: The DAEDALUS framework: progressive querying and mining of movement data. In: 16th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, ACM-GIS, p. 52 (2008)
Pelekis, N., Theodoridis, Y., Vosinakis, S., Panayiotopoulos, T.: Hermes: a framework for location-based data management. In: Proceedings of the International Conference on Extending Database Technology, pp. 1130–1134 (2006)
Quannan, L., Zheng, Y., Xing, X., Yukun, C., Wenyu, L., Wei-Ying, M.: Mining user similarity based on location history. In: Proceedings of the 16th ACM SIGSPATIAL International conference on Advances in Geographic Information Systems, GIS ’08, vol. 34, pp. 1–34:10 (2008)
Song C., Koren T., Wang P., Barabási A.-L.: Modelling the scaling properties of human mobility. Nat. Phys. 7, 713 (2010)
Song C., Qu Z., Blumm N., Barabási A.-L.: Limits of predictability in human mobility. Science 327, 1018–1021 (2010)
The PostGIS spatial database http://postgis.refractions.net/
Trasarti, R.: Mastering the Spatio-Temporal Knowledge Discovery Process. PhD in Computer science, University of Pisa (2010)
Trasarti R., Giannotti F., Nanni M., Pedreschi D., Renso C.: A query language for mobility data mining. Int. J. Data Warehousing Mining (IJDWM) 7(1), 24–45 (2011)
Trasarti, R., Pinelli, F., Nanni, M., Giannotti, F.: Mining mobility user profiles for car pooling. In: 17th ACM SIGKDD Conference on Knoledge Discovery and Data Mining (KDD’11) (2011)
Wachowicz, M., Ong, R., Renso, C., Nanni, M.: Discovering moving flock patterns among pedestrians through spatio-temporal coherence. Int. J. Geograph. Inf. Sci. (in press)
Wang, D., Pedreschi, D., Song, C., Giannotti, F., Barabási, A.-L.: Human mobility, social ties and link prediction. In: 17th ACM SIGKDD Conference on Knoledge Discovery and Data Mining (KDD’11) (2011)
Wang, H., Zaniolo C., Atlas, L.C.: A small but complete sql extension for data mining and data streams. In: Proceedings of International Conference of Very Large Data Base, pp. 1113–1116 (2003)
Wang P., Gonzalez M., Hidalgo C.A., Barabási A.-L.: Understanding the spreading patterns of mobile phone viruses. Science 324, 1071–1076 (2009)
Zheng, Y., Zhang, L., Xie, X., M, W.-Y.: Mining interesting locations and travel sequences from GPS trajectories. In: Proceedings of the 18th International Conference on World Wide Web, WWW ’09, pp. 791–800
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Giannotti, F., Nanni, M., Pedreschi, D. et al. Unveiling the complexity of human mobility by querying and mining massive trajectory data. The VLDB Journal 20, 695–719 (2011). https://doi.org/10.1007/s00778-011-0244-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00778-011-0244-8