FlowCV Logo
About Me

I am a passionate PhD researcher with expertise in graph databases, query optimization, and data quality. I will graduate in January 2024. I have a strong track record of published research and academic excellence, with almost two years of experience in the industry. I seek to apply advanced database skills and innovative problem-solving abilities to industry projects. My dedication lies in implementing data-driven solutions that can significantly impact a constantly evolving corporate environment.

Experience
PhD Researcher, Dept. of Computer Science, Aalborg University

Databases, Graphs, Query optimization, Schemas, and Data Quality

09/2020 – presentAalborg, Denmark
Scientific Research Assistant, Dept. of Computer Science, Aalborg University

Managing projects, maintaining and improving existing features, conducting surveys, and writing research reports.

10/2019 – 08/2020Aalborg, Denmark
Data Engineer Intern, everis NTT Data Company

Data Modeling, Designing Scalable Architecture, working on end-to-end solution for EU-CORDIS research projects. (Part of Semantic Business Unit)

06/2018 – 10/2018Barcelona, Spain
Software Engineer, eConceptions pvt Ltd

Backend Development (Java), Front-end development (Javascript, MEAN Stack)

01/2016 – 09/2017Islamabad, Pakistan
Web Developer, Solution Providers(eSolPro) pvt Ltd

APIs development (PHP), PSD design, HTML, CSS, Javascript

08/2015 – 09/2015Islamabad, Pakistan
Education
PhD, Dept. of Computer Science, Aalborg University

Thesis Title: Query optimization, Schema Extraction, and Data Quality for Graphs.

Supervisors: Prof. Katja Hose, Assist. Prof. Matteo Lissandrini

09/2020 – 04/2024Aalborg, Denmark
Masters in Computer Science, TU-Berlin

Master Degree in "Big Data Management and Analytics". Final Grade: 1,4 (very good)

09/2017 – 08/2019Berlin, Germany
Bachelors in Computer Science, COMSATS University

CGPA 3.84/4.00 (Campus Gold Medal & Institute Silver Medal)

02/2013 – 02/2017Islamabad, Pakistan
Area of Expertise
Databases (SQL & NoSQL)

MySQL, PostgreSQL, Oracle Database, Microsoft SQL Server, SQLite, MongoDB, Redis, Amazon DynamoDB

Programming Languages

Java, Python, C, C++, JavaScript

Academic & Soft Skills

Research, Reviewing, Paper Writing, Teaching, Supervision, Censoring, Dissemination, Networking, Collaboration

Web Development

Java Spring Framework, Vaadin Framework, MVC, HTML, CSS, Bootstrap, Ajax

Data Visualization

Tableau, Power BI, Matplotlib, Highcharts, Plotly

Cloud Platforms

Amazon Web Services (AWS), Microsoft Azure

Big Data Processing & Analytics

ETL, Spark, Apache Hadoop, Apache Flink, NLP, Weka, RapidMiner

Graphs & Semantic Technologies

Neo4j, GraphDB, Amazon Neptune, ArangoDB, OrientDB, Stardog, Virtuoso, Protege, TopBraid Suite, PoolParty, Apache Jena, RDF4j, OWL, RDF/S, SKOS, SPARQL, SHACL

Misc.

BPMN (Business Process Model and Notation), Latex, Docker

Achievements
PhD Study Abroad Travel Grants 2023, Otto Mønsted & Danish Data Science Academy

I was awarded travel grants by Otto Mønsted and Danish Data Science Academy for study abroad during my PhD studies.

2023
Big Data Talent Awards 2019 (Runner-up), FIB, UPC Barcelona, Spain

I was selected as a runner-up for Big Data Talent Awards 2019 at UPC Barcelona, Spain, for my master thesis.

2019
Erasmus Mundus Scholarship, Erasmus+ EU programme for education, training, youth and sport

I was awarded a fully funded scholarship for my master's degree in Big Data Management and Analytics.

2017
Gold & Silver Medal, COMSATS University, Islamabad Pakistan

I was awarded with a campus Gold Medal award and an institute Silver Medal award for the highest CGPA: 3.84/4.00 (Batch Spring 2013-17) of my Bachelor's degree.

2017
Professional Activities
Supervisor, Groups Supervisor for BSc Software Engineering students
  • 4th Semester Students: Supervision of groups working in medical domain for knowledge engineering. Specifically, with MIMIC healthcare dataset, aiming to design a common general purpose ontology using existing common data models like OMOP, SCDM, and PCORnet.
  • 6th Semester Students: Supervision of groups working with large scale ships AIS data aiming to create a system to improve existing navigation and illegal fishing
  • 2021 – 2023Aalborg University, DK
    Teaching Assistant, Database Management System

    Responsibilities: Database design and querying exercise sessions

    2021 – 2022Aalborg University, DK
    External Reviewer, VLDB-2022, ISWC-2022, SIGMOD-2022
    Languages
    English — C1, Danish — PD-3, Urdu — Mother Tongue
    Publications
    1. Extraction of Validating Shapes from very large Knowledge Graphs, VLDB

    Kashif Rabbani, Matteo Lissandrini, and Katja Hose, In Proceedings of the Very Large Databases 2023 (Volume 16 Issue 5, VLDB-2023), Vancouver Canada.

    2023
    2. SHACTOR: Improving the Quality of Large-Scale Knowledge Graphs with Validating Shapes, SIGMOD/ACM

    Kashif Rabbani, Matteo Lissandrini, and Katja Hose, In Proceedings of the 2023 International Conference on Management of Data, (SIGMOD-Companion '23) Seattle, USA.

    2023
    3. Incremental Schema Integration for Data Wrangling via Knowledge Graphs, SWJ

    Javier Flores, Kashif Rabbani, Sergi Nadal, Cristina Gómez, Oscar Romero, Emmanuel Jamin, and Stamatia Dasiopoulou, In Semantic Web Journal.

    2023
    4. SHACL and ShEx in the Wild: A Community Survey on Validating Shapes Generation and Adoption, TheWebConf/ ACM

    Kashif Rabbani, Matteo Lissandrini, and Katja Hose, In Companion Proceedings of the Web Conference WWW, 2022

    2022
    5. Optimizing SPARQL Queries using Shape Statistics, EDBT

    Kashif Rabbani, Matteo Lissandrini, and Katja Hose, In Proceedings of the 24th International Conference on Extending Database Technology.

    2021
    6. ODIN: A dataspace management system, ISWC

    Nadal Francesch Sergi, Kashif Rabbani, Óscar Romero Moral, and Shumet Tadesse Nigatu, In Proceedings of the ISWC 2019 Satellite Tracks (Posters & Demonstrations, Industry, and Outrageous Ideas) co-located with 18th International Semantic Web Conference (ISWC).

    2019
    7. ARDI: Automatic Generation of RDFS Models from Heterogeneous Data Sources, EDOC

    Shumet Tadesse, Cristina Gómez, Oscar Romero, Katja Hose, Kashif Rabbani, In IEEE 23rd International Enterprise Distributed Object Computing Conference (EDOC).

    2019