Big data is big
Hard-to-find special cases
The Double Anna Karenina principle
Every data set is different
Feedback loops everywhere
Can't say what works until we've done it
Death by a thousand cuts
Many tasks are repetitive
What tools do we need?
Interactive – give quick feedback
Reproducible – be able to go back
Polyglot – mix tools that work
Smart – get help from the AI
Explainable – no black boxes
1 Limited reproducibility
2 No rollback of state
3 Limited interaction model
4 One language per kernel
1 Versioning and provenance
2 Interactive development
3 Platform for AI assistants
4 Polyglot programming
New in Wrattler over the last 6 months
Integrating with standard community-driven environment
Hosting and showcasing Turing research
Simplifying data wrangling with AI assistants
Platform for Turing data science research
Platform for Turing data science research
New languages and tools for data science
Scenic places and other case studies
AI assistants for data science