Tomas Petricek
with Gerrit J.J. van den Burg, Alfredo Nazabal, Taha Ceritli, Ernesto Jimenez-Ruiz, Christopher K. I. Williams
University of Kent & The Alan Turing Institute
tomas@tomasp.net | @tomaspetricek
Getting data into usable form
Sure you want to be a data scientist?
Can automatic AI tools help with this?
Interactive - Let analyst guide and correct things
Unified - Share common structure
Accountable - Not just opaquely transform data
Tooling-rich - Integrate with notebook tools
Wrattler project
Research extension for JupyterLab
Mix languages, build interactive tools, analyse code provenance
How AI assistants work
AI assistant works with
AI assistant is defined by
Over in the machine learning world
Optimization-based AI assistants
Solve problem of finding best allowed expression
\(\mathit{best}_{\mathcal{D}}(H) = \mathit{arg~max}_{e\in E_H}~Q_H(\mathcal{D}, e)\)
Capture what an AI assistant really is
Formally define the interface
Explain how tools can use AI assistants
Lets us easily capture many examples
Patches and constraints
Optimization-based datadiff assistant
Different kind of interactivity
Still fits with the formal model!
Many existing tools fit this model!
Can be extended for Bayesian framework
Qualitative evaluation using case studies
Count necessary human interactions
Can wrangle more data with a few hints!
AI assistants: A framework for semi-automated, accountable, and tooling-rich data wrangling
Tomas Petricek, University of Kent
tomas@tomasp.net | @tomaspetricek