sql4ml A declarative end-to-end workflow for machine learning

Reading time: 1 minute
...

📝 Original Info

  • Title: sql4ml A declarative end-to-end workflow for machine learning
  • ArXiv ID: 1907.12415
  • Date: 2019-08-05
  • Authors: Nantia Makrynioti, Ruy Ley-Wild, Vasilis Vassalos

📝 Abstract

We present sql4ml, a system for expressing supervised machine learning (ML) models in SQL and automatically training them in TensorFlow. The primary motivation for this work stems from the observation that in many data science tasks there is a back-and-forth between a relational database that stores the data and a machine learning framework. Data preprocessing and feature engineering typically happen in a database, whereas learning is usually executed in separate ML libraries. This fragmented workflow requires from the users to juggle between different programming paradigms and software systems. With sql4ml the user can express both feature engineering and ML algorithms in SQL, while the system translates this code to an appropriate representation for training inside a machine learning framework. We describe our translation method, present experimental results from applying it on three well-known ML algorithms and discuss the usability benefits from concentrating the entire workflow on the database side.

📄 Full Content

📸 Image Gallery

code_files_2.png ds_workflow_larger.png features_denormalization_larger.png features_family_city_larger.png loss_LR_boston_generated.png loss_LR_boston_manual.png loss_logistic_epsilon_generated.png loss_logistic_epsilon_manual.png proposed_workflow_larger.png sql_to_tf_ast_2_larger.png sql_to_tf_ast_larger.png

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut