Project Title: III: Small: Indexing, Querying, and Visualizing Big Spatial and Spatio-temporal Data


Project Award Number: IIS-1525953

PI Name: Mohamed Mokbel
Department: Computer Science and Engineering
Institution: University of Minnesota
Address: 200 Union ST SE, Minneapolis, MN, 55455, USA
Email: mokbel@cs.umn.edu
URL: www.cs.umn.edu/~mokbel


   PhD Alumni:
  • Ahmed Eldawy (PhD., 2016, Assistant Professor at University of California Riverside)
    • Thesis Title: SpatialHadoop: A Map-Reduce framework for Big Spatial Data
  • Amr Magdy (PhD., 2017, Assistant Professor at University of California Riverside)
    • Thesis Title: Kite: A Scalable Microblogs Data Management System
   PhD Students:    Keywords:
  • Big spatial data
  • Big spatio-temporal data
  • Hadoop
  • Visualziation

   Released Software:

SpatialHadoop is an open source MapReduce framework with built-in support for spatial data. It employs the MapReduce programming paradigm for distributed processing to build a general purpose tool for large scale analysis of spatial data on large clusters. Users can interact easily with SpatialHadoop through a high level language with built-in support for spatial data types and spatial operations. Existing spatial data sets can be loaded in SpatialHadoop with the built in spatial data types point, polygon and rectangle. SpatialHadoop is also extensible and more data types can be added by users. In addition, the data sets are stored efficiently using built-in indexes (Grid file or R-tree) which speed up the retrieval and processing of these data sets. Users can build an index of their choice with a single command that runs in parallel on the machines in the cluster. Once the index is built, users can start analyzing their data sets using the built in spatial operations (range query, k nearest neighbor and spatial join). The extensibility of SpatialHadoop allows users to implement more spatial operations as MapReduce programs. For more information, please visit: "http://spatialhadoop.cs.umn.edu/"
 

ST-Hadoop is a MapReduce framework that acknowledges the fact that space and time play a crucial role in query processing. ST-Hadoop is an open-source extension of a Hadoop framework that injects the spatiotemporal awareness in the code base of four layers inside SpatialHadoop, namely, language, indexing, MapReduce, and operations layers. The spatio-temporal indexing techniques inside ST-Hadoop primarily tuned to provide the accommodation of new updated dataset efficiently without the need to rebuild its index. The key point behind the performance gain of ST-Hadoop is the idea of indexing, where data are temporary loaded and divided across computation nodes. For more information, please visit: "http://st-hadoop.cs.umn.edu/"
 
   Keynotes
  1. Keynote: Mohamed F. Mokbel "Thinking Spatial". The Asia Pacific Web and Web-Age Information Management Joint Conference on Web and Big Data, APWeb-WAIM, Beijing, China, July 2017.

  2. Plenary Keynote: Mohamed F. Mokbel "The Era of Big Spatial Data". A Plenary Keynote for three parallel international conferences: IEEE Conference on Big Data and Cloud Computing (BDCloud), IEEE Conference on Social Computing and Networking (SocialCom), and IEEE Conference on Sustainable Computing (SustainCom) Atlanta, GA, October 2016. .

  3. VLDB 10-Years Best Paper Award Talk: Mohamed F. Mokbel "Location Data Management: A Tale of Two Systems and the Next Destination!". In the International Conference on Very Large Databases, VLDB 2016, New Delhi, India, September 2016.

  4. Keynote: Mohamed F. Mokbel "The Era of Big Spatial Data". In the 27th Australian Database Conference (ADC), Sydney, Australia, September 2016.

  5. Keynote: Mohamed F. Mokbel "Towards a Microblogs Data Management System". In the 6th International Workshop with Mentors on Databases, Web and Information Management for Young Researchers, iDB, Nara, Japan, August 2015.

   Tutorials
  1. 90-minutes Tutorial: Ahmed Eldawy and Mohamed F. Mokbel " The Era of Big Spatial Data". In Proceedings of the International Conference on Very Large Databases, VLDB 2017, Munich, Germany, August 2017.

  2. 90-minutes Tutorial: Amr Magdy and Mohamed F. Mokbel " Microblogs Data Management and Analysis". In ACM International Conference on Management of Data, SIGMOD 2016, San Francisco, CA, June, 2016.

  3. 90-minutes Tutorial: Ahmed Eldawy and Mohamed F. Mokbel " The Era of Big Spatial Data". In the IEEE International Conference on Data Engineering, ICDE 2016, Helsinki, Finland, May, 2016.

  4. 90-minutes Tutorial: Amr Magdy and Mohamed F. Mokbel " Microblogs Data Management and Analysis". In the IEEE International Conference on Data Engineering, ICDE 2016, Helsinki, Finland, May, 2016.

  5. 2-hours Tutorial: Ahmed Eldawy and Mohamed F. Mokbel " The Era of Big Spatial Data". In the IEEE International Conference on Big Data,BigData 2015, Santa Clara, CA, Oct., 2015.

   Invited Talks
  1. "Thinking Spatial". ". Microsoft Research Asia, Beijing, China, July, 2017.

  2. "The Era of Big Spatial Data". ". China Agriculture University, Beijing, China, July, 2017.

  3. "Thinking Spatial". Huawei, Santa Clara, CA, Dec 2016.

  4. "Thinking Spatial". Uber, Palo Alto, CA, Dec 2016.

  5. "The Era of Big Spatial Data". Qatar Computing Research Institute, QCRI, Doha, Qatar, Oct 2016.

  6. "Towards a Microblogs Data Management System". University of New South Wales, UNSW, Sydney, Australia, Sep 2016.

  7. "The Era of Big Spatial Data". University of Melbourne, Melbourne, Australia, Sep 2016.

  8. "The Era of Big Spatial Data". ". University of Iowa, Iowa City, IA, USA, May, 2016.

  9. "The Era of Big Spatial Data". ". King Fahd University of Petroleum and Mining, KSA, Mar., 2016.

  10. "The Era of Big Spatial Data". ". New York University - Abu Dhabi, Abu Dhabi, UAE, Feb., 2016.

  11. "The Era of Big Spatial Data". Emory University, Atlanta, GA, USA, Oct., 2015.

   Journal Publications:
  1. Abdeltawab Hendawi, Mohamed Ali, Mohamed F. Mokbel. " Panda*: A Generic and Scalable Framework for Predictive Spatio-temporal Queries". GeoInformatica. 21(2): 175-208, 2017.

  2. Xiaochuang Yao, Mohamed F. Mokbel, Louai Alarabi, Ahmed Eldawy, Jianyu Yang, Wenju Yun, Lin Li, Sijing Ye, and Dehai Zhu. " Spatial Coding-based Approach for Partitioning Big Spatial Data in Hadoop". Elsevier Computers & Geosciences. 106, 60-67, 2017.

  3. Ahmed Eldawy, Mohamed F. Mokbel. " The Era of Big Spatial Data: A Survey". Foundations and Trends in Databases. 6(3-4): 163-273, 2016.

  4. Amr Magdy, Mohamed F. Mokbel, Sameh Elnikety, Suman Nath, and Yuxiong He. " Venus: Scalable Real-time Spatial Queries on Microblogs with Adaptive Load Shedding". IEEE Transactions on Knowledge and Data Engineering, TKDE 2016, 28(2): pp. 356-370, 2016.

   Conference Publications:
  1. Mohamed Sarwat, Raha Moraffah, Mohamed F. Mokbel and James Avery " Database System Support for Personalized Recommendation Applications". In Proceedings of the IEEE International Conference on Data Engineering, ICDE 2017, San Diego, CA, Apr, 2017.

  2. Constantinos Costa, Georgios Chatzimilioudis, Demetrios Zeinalipour-Yazti, Mohamed F. Mokbel " Efficient Exploration of Telco Big Data with Compression and Decaying". In Proceedings of the IEEE International Conference on Data Engineering, ICDE 2017, San Diego, CA, Apr, 2017.

  3. Ibrahim Sabek and Mohamed Mokbel. "On Spatial Joins in MapReduce". In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL 2017, Redondo Beach, CA, November 2017.

  4. Louai Alarabi, Mohamed F. Mokbel, and Mashaal Musleh " ST-Hadoop: A MapReduce Framework for Spatio-temporal Data". In Proceedings of the International Symposium on Spatial and Temporal Databases, SSTD 2017, Washington, D.C, Aug., 2017.

  5. Ahmed Eldawy, Ibrahim Sabek, Mostafa Elganainy, Ammar Bakeer, Ahmed Abdelmotaleb, and Mohamed Mokbel " Sphinx: Empowering Impala for Efficient Execution of SQL Queries on Big Spatial Data". In Proceedings of the International Symposium on Spatial and Temporal Databases, SSTD 2017, Washington, D.C, Aug., 2017.

  6. Christopher Jonathan and Mohamed Mokbel " Towards a Unified Spatial Crowdsourcing Platform (Vision Paper)". In Proceedings of the International Symposium on Spatial and Temporal Databases, SSTD 2017, Washington, D.C, Aug., 2017.

  7. Ahmed Eldawy, Mohamed F. Mokbel and Christopher Jonathan. " HadoopViz: A MapReduce Framework for Extensible Visualization of Big Spatial Data". In Proceedings of the IEEE International Conference on Data Engineering, ICDE 2016, Helsinki, Finland, May, 2016.

  8. Amr Magdy, Rami Alghamdi, and Mohamed F. Mokbel. " On Main-memory Flushing in Microblogs Data Management Systems". In Proceedings of the IEEE International Conference on Data Engineering, ICDE 2016, Helsinki, Finland, May, 2016.

  9. Christopher Jonathan, Amr Magdy, Mohamed F. Mokbel and Albert Jonathan. " GARNET: A Holistic System Approach for Trending Queries in Microblogs". In Proceedings of the IEEE International Conference on Data Engineering, ICDE 2016, Helsinki, Finland, May, 2016.

  10. Amr Magdy, Ahmed Aly, Mohamed Mokbel, Sameh Elnikety, Yuxiong He, Suman Nath and Walid Aref. "GeoTrend: Spatial Trending Queries on Real-time Microblogs". In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL 2016, San Francisco, CA, November 2016.

  11. Ahmed Eldawy, Louai Alarabi and Mohamed F. Mokbel. " Spatial Partitioning Techniques in SpatialHadoop". In Proceedings of the the International Conference on Very Large Databases, VLDB 2015, Kohala Coast, HI, Aug, 2015.

  12. Ahmed Eldawy, Mostafa Elganainy, Ammar Bakeer, Ahmed Abdelmotaleb and Mohamed F. Mokbel. "Sphinx: Distributed Execution of Interactive SQL Queries on Big Spatial Data (Poster Paper)". In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL GIS 2015, Seattle, WA, November 2015.

  13. Reem Y. Ali, Venkata M.V. Gunturi, Shashi Shekhar, Ahmed Eldawy, Mohamed F. Mokbel, Andrew J. Kotz and William F. Northrop. "Future Connected Vehicles: Challenges and Opportunities for Spatio-temporal Computing (Vision paper)". In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL GIS 2015, Seattle, WA, November 2015. (Runner-Up for Best Vision Paper Award).


   System Demos:
  1. Christopher Jonathan and Mohamed F. Mokbel " A Demonstration of Stella: A Crowdsourcing-Based Geotagging Framework". In Proceedings of the International Conference on Very Large Databases, VLDB 2017, Munich, Germany, Aug., 2017.

  2. Louai Alarabi and Mohamed F. Mokbel " A Demonstration of ST-Hadoop: A MapReduce Framework for Big Spatio-temporal Data". In Proceedings of the International Conference on Very Large Databases, VLDB 2017, Munich, Germany, Aug., 2017.

  3. Harshada Chavan and Mohamed F. Mokbel " Scout: A GPU-Aware System for Interactive Spatio-temporal Data Visualization". In Proceedings of ACM SIGMOD Conference on Management of Data, ACM SIGMOD 2017, Chicago, IL, May, 2017.

  4. Amr Magdy and Mohamed F. Mokbel " Demonstration of Kite: A Scalable System for Microblogs Data Management". In Proceedings of the IEEE International Conference on Data Engineering, ICDE 2017, San Diego, CA, Apr, 2017.

  5. Constantinos Costa, Georgios Chatzimilioudis, Demetrios Zeinalipour-Yazti and Mohamed Mokbel " SPATE: Compacting and Exploring Telco Big Data". In Proceedings of the IEEE International Conference on Data Engineering, ICDE 2017, San Diego, CA, Apr, 2017.

  6. Louai Alarabi, Mohamed Mokbel, Bin Cao, Liwei Zhao, Anas Basalamah. "A Demonstration of SHAREK: An Efficient Matching Framework for Ride Sharing Systems". In Proceedings of the ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL 2016, San Francisco, CA, November 2016.

  7. Ahmed Eldawy, Mohamed F. Mokbel, and Christopher Jonathan. " A Demonstration of HadoopViz: An Extensible MapReduce System for Visualizing Big Spatial Data". In Proceedings of the International Conference on Very Large Databases, VLDB 2015, Kohala Coast, Hawaii, August, 2015.


   Workshop Papers:
  1. Harshada Chavan, Rami Alghamdi, Mohamed F. Mokbel. "Towards a GPU Accelerated Spatial Computing Framework". In Proceeding of the IEEE ICDE Workshop on on Big Data Management on Emerging Hardware, HardBD 2016, co-located with ICDE 2016 , Helsinki, Finland, May, 2016.