YONGJOO PARK

Assistant Professor, Illinois CS (UIUC)
yongjoo@g.illinois.edu

Co-founder / Chief Scientist, Keebo
yongjoo@keebo.ai

[CV]    [Publication]    [CreateLab]    [orcid]

Research   I'm interested in big data systems. Specifically, I am investigating how we should build systems and frameworks that can offer more interactive, adaptive, and scalable data management in modern computing environments.

At IllinoisCS, I am a member of DAIS. You can join DAIS Slack Workspace


News (Sep 2023)   "ElasticNotebook" is accepted to PVLDB'23 (or VLDB'24). Congrats Billy!

News (June 2023)   Awarded $1.2M NSF grant with Hari for "system + causal inference"!

News (May 2023)   "AirIndex" is accepted to SIGMOD 2024. Congrats Supawit!

News (April 2023)   "Transactional Python" is accepted to DEEM @SIGMOD 2023. Congrats Supawit and Billy!

News (Feb 2023)   "DeepOLA" is accepted to SIGMOD 2023. Congrats Nikhil and Supawit!

News (Feb 2023)   Billy's work "S/C" is accepted to ICDE 2023.

News (Feb 2023)   Awarded NCSA Fellowship. Thank you so much!

News (Oct 2022)   Keebo raises $10.5M for Series A; total funding is $15M!   (news)   (interview)

  • TEACHING

    SP 2024. CS 598 YP - ML and Data Systems     [Course Info]   [Canvas]  

    FA 2023. CS 511 - Advanced Data Management     [Course Info]   [Canvas]  

    FA 2022. CS 511 - Advanced Data Management     [Course Info]   [Canvas]

    FA 2021. CS 411 - Database Systems     [Course Info]

    SP 2021. CS 511 - Advanced Data Management     [Course Info]

  • PROJECTS

    Currently, my primary research focus is to develop innovative Systems for Data Science.

    • CARE (CAusal-RElational data system) aims to develop a system for end-to-end causal inference with an emphasis on efficiency and ease of use. Causal inference is a principled approach to understanding why.
    • Kishu aims to bring durability, atomicity, replicability, and time versoning to Jupyter-like interactive data science systems. This project will address fundamental limitations that make those systems brittle.
      [Transactional Python, DEEM@SIGMOD’23]
      [ElasticNotebook, VLDB’24]
    • DeepOLA aims to offer progressive analytics (or Online Aggregation) to continuous data exploration frequently observed with Pandas-like libraries.   [SIGMOD’23]

    I am also investigating how data systems (such as key-value stores) should look like for emerging ad-hoc workloads.

    • AirDB aims to allow transactions directly on storage layer (without servers).
    • AirIndex optimizes the structures of indexes automatically, considering system performance like latency, bandwidth, etc.   [SIGMOD’24]
  • ADVISING

    I am fortunate to work with these smart and motivated students.

    1. Supawit Chockchowwat (PhD student, from 2020)
    2. Billy Li (PhD student, from 2021)
    3. Dohyun Park (PhD student, from 2023)
    4. Chuxuan Hu (PhD student, from 2023)
    5. Xinying Zheng (PhD student, from 2023, with Indy Gupta)
    6. Hanxi Fang (MS student, from 2023)
    7. Raunak Shah (MS student, from 2023)
    8. Shunning Zhang (MS student, from 2023)
    9. Arthur Huang (MS student, from 2023)
    10. Hanjun Goo (PhD student @SNU, in collaboration with Kyuseok Shim)

    You can also find undergraduate student and past members on this research group website.

    Prospective PhD Students: Every year, I recruit one or two new PhD students. If you are interested in working with me, please indicate in your application or send me an email (to my Illinois address).

    These are some helpful articles I wrote:

    1. TOEFL score requirements for CS@UIUC
    2. Information about Illinois’ PhD admission process (2020)

    UIUC Undergrads: Every semester, I plan to advise a few students (and more during summer). If you are interested, please send me an email.

    Prospective Master’s Students: In general, I prioritize supporting PhD students. However, I may support a few exceptional MS students with RA. (Almost all) MS students are supported with TA by the department.