Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2
Published in AINA 2013, 2013
This paper studied a shared-memory approach to improve the performance of hadoop streaming, an extension of Hadoop that supports using programming languages other than JAVA. You can fork the code in my Github page.
Download here
Published in PVLDB 8(10), 2017
This paper studied the subgraph enumeration problem using MapReduce. In this paper, a special structure called TwinTwig has been used in the left-deep join structure and proven to be instance-optimal compared to the traditional join based on star. The source code can be forked from my GitHub pages.
Download here
Published in The VLDB Journal 26(3), 2017
This paper extended the VLDB15 paper to include the data-graph compression technique via structural symmetry.
Download here
Published in UNSW-CSE-Tech, 2017
This is the technical report of my VLDB17 subgraph enumeration paper, in which you can find full proofs and complete algorithms for clique compression.
Download here
Published in PVLDB 10(3), 2017
This paper explored an optimal bushy join algorithm for subgraph enumeration in the distributed context. Please fork and play with the source code from my GitHub page.
Download here
Published in VLDB 2019 (to appear), 2019
This paper systematically compares the existing works on distributed subgraph matching algorithms in the literature. It summarizes the literature into four strategies: BinJoin (binary-join-based subgraph growing algorithms), WOptJoin (worst-case optimal vertex growing algorithms), ShrCube (Sharing of Hypercubes) and FulRep (Full replica of graph data in each machine). We are currently working on open sourcing the codes. Please stay tuned!
Download here
Postgraduate course, CSE, UNSW, 2015
This course is mainly about the principles of Database systems, including ERand relational model, relational algebra, indexing techniques and join optimisations. I was tutoring the lab of the course, and my duties include designing the programming practices, supervising the lab and Q&A.
Undergraduate course, CSE, UNSW, 2017
This is joint course of Computer Science, Mathematics and Business School of UNSW. I am in charge of the lecture of the computer science part. The lecture includes an overview of data science, ER and relational model, SQL language, Big Data, MapReduce, and research topics on Graph, Spatial and Uncertain Data.