Paweł Guzewicz


I am a PhD student at École Polytechnique in area of cloud-scale databases and graph knowledge representation. I am a member of CEDAR, a joint team of Inria Saclay and LIX (CNRS – UMR 7161 and École Polytechnique). My PhD thesis is on expressive and efficient analytics for RDF graphs.

I work at Alan Turing Building (1 Rue Honoré d'Estienne d'Orves, 91120 Palaiseau), office 1027.

Here are some photos of me working, taken during CEDAR team photo session (December 2019).



  • Big data analytics
  • Cloud scale computing
  • Graph knowledge representation
  • Data warehouse architectures
  • Semantic Web


Programming Languages & Tools

  • Java
  • Python
  • Scala
  • C and C++
  • SQL
  • Spark
  • Hadoop
  • Git
  • SVN
  • Bash
  • LaTeX

Natural Languages:

  • Polish (mother tongue)
  • English (full professional proficiency)
  • French (communicative proficiency)
  • German (elementary proficiency)
  • Russian (elementary proficiency)


École Polytechnique (Institut Polytechnique de Paris)

PhD degree

Computer Science, ExpRalytics: Expressive and Efficient Analytics for RDF Graphs

October 2018 - (September 2021)

Télécom ParisTech (Université Paris-Saclay)

Master’s degree, M2

Computer Science, Data & Knowledge with mention "Très bien"

September 2017 - September 2018

École Polytechnique Fédérale de Lausanne

Erasmus+ exchange, first year of master studies

Computer Science

September 2016 - July 2017

University of Wrocław

Bachelor’s degree in Computer Science

Computer Science

October 2013 - July 2016


École Polytechnique

PhD Student

ExpRalytics: Expressive and Efficient Analytics for RDF Graphs

October 2018 - (September 2021)


Research Intern

Research in the domain of data representation with RDF graphs

March 2018 - August 2018


PHP Backend Developer

Programming backend of the mobile application for selling meals, working in the startup environment

May 2017 - August 2017

Human Dialog

C++ Programmer

Programming graphical user interface for enterprise resource planning application, working in the team

July 2015 - August 2015


Java Software Tester

Creating test cases and scenarios for software for algorithmic trading, working in agile framework: scrum

July 2012


Google Scholar profile

List of publications at HAL archive server

  1. RDF graph summarization for first-sight structure discovery
    François Goasdoué, Paweł Guzewicz, Ioana Manolescu
    The VLDB Journal <hal-02530206>
  2. Spade: A Modular Framework for Analytical Exploration of RDF Graphs
    Yanlei Diao, Paweł Guzewicz, Ioana Manolescu, Mirjana Mazuran
    VLDB - 45th International Conference on Very Large Data Bases, Los Angeles, United States, August 2019. <hal-02152844>
  3. Parallel Quotient Summarization of RDF Graphs
    Paweł Guzewicz, Ioana Manolescu
    Semantic Big Data - 4th International Workshop on Semantic Big Data in conjunction with SIGMOD conference, Amsterdam, Netherlands, July 2019. <hal-02106521>
  4. Incremental structural summarization of RDF graphs
    François Goasdoué, Paweł Guzewicz, Ioana Manolescu
    EDBT - 22nd International Conference on Extending Database Technology, Lisbon, Portugal, March 2019. <hal-01978784>
  5. Internship report: Quotient RDF graph summarization
    Paweł Guzewicz
    Palaiseau, France, September 2018. <hal-01879898>
  6. Quotient RDF Summaries Based on Type Hierarchies
    Paweł Guzewicz, Ioana Manolescu
    DESWeb - Data Engineering meets the Semantic Web in conjunction with IEEE ICDE - 34th IEEE International Conference on Data Engineering, Paris, France, April 2018. <hal-01721163v2>


  1. Best Demo Paper Award
    Award for Spade at BDA conference in Lyon

Community Service


October 2019 - now
December 2018 - April 2019

PhD Students Representation

PhD Students Representative at LIX lab of École Polytechnique
March 2019 - January 2020

Web Mastering

January - September 2019