Posts by Collection

portfolio

publications

Scalable Subgraph Enumeration in MapReduce

Published in PVLDB 8(10), 2017

This paper studied the subgraph enumeration problem using MapReduce. In this paper, a special structure called TwinTwig has been used in the left-deep join structure and proven to be instance-optimal compared to the traditional join based on star. The source code can be forked from my GitHub pages.

Download here

Distributed Subgraph Matching on Timely Dataflow [Experiments and Analyses]

Published in VLDB 2019 (to appear), 2019

This paper systematically compares the existing works on distributed subgraph matching algorithms in the literature. It summarizes the literature into four strategies: BinJoin (binary-join-based subgraph growing algorithms), WOptJoin (worst-case optimal vertex growing algorithms), ShrCube (Sharing of Hypercubes) and FulRep (Full replica of graph data in each machine). We are currently working on open sourcing the codes. Please stay tuned!

Download here

talks

teaching

CS9311/CS3311 Database Systems Tutoring

Postgraduate course, CSE, UNSW, 2015

This course is mainly about the principles of Database systems, including ERand relational model, relational algebra, indexing techniques and join optimisations. I was tutoring the lab of the course, and my duties include designing the programming practices, supervising the lab and Q&A.

DATA 1001 lecturing

Undergraduate course, CSE, UNSW, 2017

This is joint course of Computer Science, Mathematics and Business School of UNSW. I am in charge of the lecture of the computer science part. The lecture includes an overview of data science, ER and relational model, SQL language, Big Data, MapReduce, and research topics on Graph, Spatial and Uncertain Data.