GRADES-NDA
Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)
Co-located with SIGMOD 2020 (14 June 2020, Portland, OR, USA)

The video recording of the workshop is now available here.


The focus of the GRADES-NDA workshop is the application areas, usage scenarios and open challenges in managing large-scale graph-shaped data. The workshop is a forum for exchanging ideas and methods for mining, querying and learning with real-world network data, developing new common understandings of the problems at hand, sharing of data sets and benchmarks where applicable, and leveraging existing knowledge from different disciplines. Additionally, considering specific techniques (e.g., algorithms, data/index structures) in the context of the systems that implement them, GRADES-NDA aims to present technical contributions inside graph, RDF, and other data management systems on graphs of a large size.

Keynote Speakers

We are honored to have the following keynote speakers from different communities (Data Mining/Semantic Web/Databases) to talk about their exciting research in the broad field of graph data management.

Program

All times are in PDT (Pacific Daylight Time, UTC/GMT -7 hours).

Opening remarks bright and early at 8AM

8:10AM-9:00AM Session 1:   Graph learning and benchmarking

  • Graph Learning with Loss-Guided Training Eliav Buchnik and Edith Cohen
  • The Graph Based Benchmark Suite Laxman Dhulipala, Jessica Shi, Tom Tseng, Guy Blelloch and Julian Shun
  • Supporting Dynamic Graphs and Temporal Entity Deletions in the LDBC Social Network Benchmark's Data Generator Jack Waudby, Benjamin A. Steer, Arnau Prat-Pérez and Gábor Szárnyas

Coffee break

9:15AM-10:15AM Keynote: The Web of Data is partially unavailable, so what? - Katja Hose Abstract: More and more knowledge graphs are becoming available on the Web, many of them accessible via queryable interfaces, such as SPARQL endpoints. These knowledge graphs are often linked to each other and in doing so form the so-called Web of Data. The great strength and potential of exploiting these links is that we can obtain answers for queries that would not yield any results over the knowledge graphs if considered individually. However, one of the fundamental problems is that Web-accessible knowledge graphs are not always available when we would like to access them. This means that the exact same query issued at different points in time might have different results or, in the worst case, no results if one of the knowledge graphs that are essential to answer the query is (temporarily) not available. Often this is caused by crashes of the server hosting the knowledge graph – a problem that is deeply rooted within the architecture of the current Web of Data, i.e., relying on data providers to keep interfaces and knowledge graphs available and accessible. Building upon the basics of federated query processing over knowledge graphs, this talk will discuss challenges as well as strategies on how to overcome the unavailability problem and keep the data available and queryable. Speaker bio: Katja Hose is a Professor in the Department of Computer Science at Aalborg University. Prior to joining Aalborg University, she was a postdoc at the Max Planck Institute for Informatics in Saarbrücken, Germany, and earned her PhD in Computer Science from Ilmenau University of Technology, Germany. Her research is rooted in databases and Semantic Web technologies and spans theory, algorithms, and applications of Data and Web Science incl. knowledge querying, analyzing, publishing, and extracting. She has co-authored more than 90 peer-reviewed publications (including a few best paper and best paper runner-up awards). Katja Hose regularly serves as a reviewer for databases and Semantic Web conferences and journals (e.g., VLDB, SIGMOD, TheWebConf/WWW, ISWC,TKDE, DAPD). She has served in many different roles as (co-)chair (workshop, senior PC, tutorials, posters, demos, publicity, proceedings) for a broad range of international conferences (e.g., VLDB, SIGMOD, ICDE, TheWebConf/WWW, ISWC).

Tea and cookies break

10:30AM-11:25AM Session 2:   Graph analysis and pattern matching

  • ELite: Cost-effective Approximation of Exploration-based Graph Analysis Xiaofei Zhang, Tamer Ozsu and Lei Chen
  • EdgeFrame: Worst-Case Optimal Joins for Graph-Pattern Matching in Spark Per Fuchs, Peter Boncz and Bogdan Ghit
  • Towards Interactive Pattern Search in Massive Graphs Tahsin Reza, Matei Ripeanu, Geoffrey Sanders and Roger Pearce

Snack break

11:40AM-12:40PM Keynote: Graph Neural Networks for Graph Search - Yizhou Sun Abstract: Graph neural networks (GNNs) have received more and more attention in past several years, due to the wide applications of graphs and networks, and the superiority of their performance compared to traditional heuristics-driven approaches. However, most existing GNNs still focus on node-level applications, such as node classification and link prediction, and many challenging graph tasks are graph-level, such as graph search. In this talk, I will introduce our recent progress on graph-level neural operator development. In particular, we will examine three challenging tasks that are key to the success of graph search: (1) How can we conduct efficient graph similarity search by turning the NP-Complete problems, such as the Graph Edit Distance (GED) and Maximum Common Sub-graph (MCS) computation, into a learning problem? We will present SimGNN and GraphSim that are able to provide more efficient and effective results compared to state-of-the-art approximate algorithms. (2) How can we provide a neural operator that can turn any graph into a low dimensional representation vector, which is learnable, inductive, and unsupervised? In this line, we propose UGraphEmd that is able to leverage graph-graph interaction to produce manifold-preserving graph-level embedding. Moreover, GHashing is designed to map each graph to a discrete hash code, which enables a much more efficient search (20 times speed up) to handle large graph database with millions of graphs. And (3) how can we design GNNs that can directly detect the best matched sub- graphs of two graphs? A deep reinforcement learning framework RLMCS is then proposed to address this issue, with the goal to learn the best strategy to pick the next matching pair for two graphs. In the end, we will provide some discussions to the open questions in the field. Speaker bio: Yizhou Sun is an associate professor at department of computerscience of UCLA. Prior to that, she was an assistant professor in the College of Computer and Information Science of Northeast- ern University. She received her Ph.D. in Computer Science from the University of Illinois at Urbana-Champaign in 2012. Her principal research interest is on mining graphs/networks, and more generally in data mining, machine learning, and network science, with a focus on modeling novel problems and proposing scalable algorithms for large-scale, real-world applications. She is a pioneer researcher in mining heterogeneous information network, with a recent focus on deep learning on graphs/networks. Yizhou has over 100 publications in books, journals, and major conferences. Tutorials of her research have been given in many premier conferences. She received 2012 ACM SIGKDD Best Student Paper Award, 2013 ACM SIGKDD Doctoral Dissertation Award, 2013 Yahoo ACE (Academic Career Enhancement) Award, 2015 NSF CAREER Award, 2016 CS@ILLINOIS Distinguished Educator Award, 2018 Amazon Research Award, and 2019 Okawa Foundation Research Grant.

Lunch

1:35PM-2:30PM Session 3:   Potpourri

  • Smooth Kronecker: Solving the Combing Problem in Kronecker Graphs Vaastav Anand, Puneet Mehrotra, Daniel Margo and Margo Seltzer
  • Context-Free Path Querying with Single-Path Semantics by Matrix Multiplication Rustam Azimov, Semyon Grigorev, Artem Khoroshev and Arseniy Terekhov
  • A Framework for DSL-Based Query Classification Using Relational and Graph-Based Data Models Peter K. Schwab, Maximilian Langohr and Klaus Meyer-Wegener

Sugar and caffeine break

2:45PM-3:45PM Industrial talks

  • Working with property graphs using SAP HANA SAP
  • Graph Data Science with Neo4j Neo Technology
  • IBM Db2 Graph: Synergistic and Retrofittable Graph Queries inside IBM Db2 IBM
  • Modern Aggregation Support in TigerGraph Alin Deutsch, TigerGraph
  • Graph Applications and Systems in Alibaba Alibaba

Social break

4:00PM-5:00PM Free-roam posters and demos

Demos:

  • Addressing graph worflows using SAP HANA SAP
  • Hands on: using Neo4j's Graph Data Science Library Neo Technology
  • Supporting Synergistic and Retrofittable Graph Queries using IBM Db2 Graph IBM
  • Why You Should Train Large Graph Convolutional Network (GCN) In A Graph Database - a GCN demo in TigerGraph Changran Liu, TigerGraph
  • The Graph Based Benchmark Suite Laxman Dhulipala, Jessica Shi, Tom Tseng, Guy Blelloch and Julian Shun
  • A Framework for DSL-Based Query Classification Using Relational and Graph-Based Data Models Peter K. Schwab, Maximilian Langohr and Klaus Meyer-Wegener

Posters:

  • Graph Learning with Loss-Guided Training Eliav Buchnik and Edith Cohen
  • Supporting Dynamic Graphs and Temporal Entity Deletions in the LDBC Social Network Benchmark's Data Generator Jack Waudby, Benjamin A. Steer, Arnau Prat-Pérez and Gábor Szárnyas
  • ELite: Cost-effective Approximation of Exploration-based Graph Analysis Xiaofei Zhang, Tamer Ozsu and Lei Chen
  • EdgeFrame: Worst-Case Optimal Joins for Graph-Pattern Matching in Spark Per Fuchs, Peter Boncz and Bogdan Ghit
  • Towards Interactive Pattern Search in Massive Graphs Tahsin Reza, Matei Ripeanu, Geoffrey Sanders and Roger Pearce
  • Smooth Kronecker: Solving the Combing Problem in Kronecker Graphs Vaastav Anand, Puneet Mehrotra, Daniel Margo and Margo Seltzer
  • Context-Free Path Querying with Single-Path Semantics by Matrix Multiplication Rustam Azimov, Semyon Grigorev, Artem Khoroshev and Arseniy Terekhov

Closing remarks and virtual drinks

Call for Papers

The goal of GRADES-NDA is to bring together researchers from academia, industry, and government, (1) to create a forum for discussing recent advances in (large-scale) graph data management and analytics systems, as well as propose and discuss novel methods and techniques towards (2) addressing domain specific challenges or (3) handling noise in real-world graphs.

The workshop will be of interest to researchers in the development of novel data-management applications and systems for large-scale graph analytics. More specifically, the intended audience are, but not limited to, academic and industrial computer scientists interested in databases and data mining, machine learning, data streaming, graph theory and algorithms. Along with novel research work, we encourage submissions with demonstrations and case studies from real-life experiences in various domains such as Social Networks, Biological Network Data, Marketing and Media, Business Data Analysis, Healthcare Data, Cybersecurity etc.

Topics of interest include but are not limited to the following.

  • Graph query languages, visualization techniques and querying interfaces, and their effective realization
  • Graph platform and parallel platforms, e.g., Flink/Gelly, Titan, SPARK/GraphX, GraphLab/PowerGraph, Giraph, GraphChi etc.
  • Network data representation, storage, indexing and querying methods.
  • Experiences or techniques for graph specific operations such as traversals or inference/reasoning in the context of large data sets and on the systems that implement those operations.
  • RDF data management and analytics
  • Dynamic Graphs: managing graph updates; graph stream analytics; analyzing evolution and detection of community structures in real-world evolving graphs
  • Mining and machine learning on heterogeneous networks -- knowledge graphs etc.
  • Graph summarization and sampling
  • Game Theory, Social contagion and Information propagation on networks
  • Analytics on dirty, noisy, or uncertain graphs
  • Spatial and temporal graph analytics
  • Analytics on social, biological, retail, marketing, customer care, financial, healthcare, transportation network data sets
  • Descriptions of graph data management use cases and query workloads, and experiences with applying data management technologies in such situations
  • Vision and systems papers describing potential or real applications and benefits of graph management

Accepted papers will be published by ACM, indexed by DBLP, and would be available in the ACM DL.

Important Dates

  • Paper Submission: March 23 April 6, 2020
  • Notifications: April 13 April 27, 2020
  • Camera Ready Submission: April 27 May 11, 2020
  • Workshop Date: June 14, 2020

All deadlines are 23:59 Hours AoE

We are aware that many people are trying to adapt to a new and challenging life circumstances due to the global spread of COVID-19. To accommodate for this, we decided to extend the GRADES-NDA submission and review deadlines by 2 weeks.

Workshop Organizers


Steering Committee

Paper Submission

Authors are invited to submit original, unpublished research papers (full and short), demonstrations and case-studies.

Submissions must follow the latest 2-column ACM Master article "sigconf" proceedings LaTeX template with 10pt font size, and should be double-blind. Details on the Anonymity requirements for submitted manuscripts are present at the SIGMOD 2020 Call for Papers page.

Length Requirements:

  • Full papers should be a maximum of 8 pages in length, excluding references and appendix.
  • Case studies should be a maximum of 4 pages in length, excluding references and appendix.
  • Short papers and demonstration papers should be a maximum of 4 pages in length, excluding references and appendix.

Submissions will be handled through Easychair. To submit click here.

Program Committee

  • Renzo Angles, Universidad de Talca
  • Marcelo Arenas, PUC Chile
  • Alex Averbuch, Neo4j
  • Amitabha Bagchi, Indian Institute of Technology, Delhi
  • Srikanta Bedathur, Indian Institute of Technology, Delhi
  • Peter Boncz, CWI
  • Yang Cao, The University of Edinburgh
  • Edith Cohen, Google Research
  • Mahashweta Das, Visa Research
  • Sourav Dutta, Bell Labs, Ireland
  • Irini Fundulaki, ICS-Forth
  • Sainyam Galhotra, UMass Amherst
  • Alberto Garcia-Duran, EPFL
  • Kiran Garimella, MIT
  • Ricard Gavaldà , UPC Barcelona
  • Joan Guisado-Gámez, UPC Barcelona
  • Jiawei Han, University of Illinois Urbana-Champaign
  • Russ Harmer, CNRS & ENS Lyon
  • Jan Hidders, VU Brussels
  • Vasiliki Kalavri, Boston University
  • Panos Kalnis, King Abdullah University of Science and Technology
  • Josep Lluís Larriba-Pey, UPC Barcelona
  • Essam Mansour, Concordia University
  • Marcus Paradies, DLR
  • Arnau Prat-Pérez, Sparsity
  • Maya Ramanath, Indian Institute of Technology, Delhi
  • Sayan Ranu, Indian Institute of Technology, Delhi
  • Erdem Sariyuce, University at Buffalo
  • Juan Sequeda, Capsenta Labs
  • Julian Shun, MIT
  • Yogesh Simmhan, Indian Institute of Science, Bangalore
  • Andreas Spitz, Ecole Polytechnique Fédérale de Lausanne
  • Gábor Szárnyas, Budapest University of Technology and Economics
  • Martin Theobald, University of Luxembourg
  • Enara Vijil, IBM
  • Hannes Voigt, Neo4j
  • Tianxing Wu, Nanyang Technological University
  • Yinghui Wu, Washington State University
  • Yinglong Xia, Facebook Research
  • Oskar van Rest, Oracle

Past Workshops

GRADES-NDA is in its third edition, and had successful joint meetings collocated with SIGMOD/PODS 2018 and 2019 respectively. Specifically, it is the merger of the GRADES and NDA workshops, which were each independently organized and successfully held at previous SIGMOD-PODS meetings, GRADES since 2013 and NDA since 2016. The organizers of GRADES and NDA mutually agreed upon to aim for a joint meeting from 2018 onwards.

Sponsored by

Neo4j TigerGraph Alibaba
IBM SAP GOOGLE

Get in touch

For questions, please email. Follow us on Twitter for latest updates about the workshop.