Agile and Self-Service Data Preparation
Over the past few years we’ve witnessed a profusion of data made available, along with services to process them and turn them into insights. However, the tools available for end users haven’t fully caught up. Spreadsheets offer entry level interface to the data but are time consuming and don’t scale, while languages like R or ipython offer flexibility but have a steep learning curve for the non technical person.
Domain experts need powerful yet easy-to-use interfaces to explore new data sets, normalize them and process them via innovative services often available via an API only. OpenRefine offers the best of both worlds with a self service agile and iterative interface for data discovery and preparation and an easy-to-learn scripting language.
Martin Magdinier will present how OpenRefine can help to make your data science project more agile. Business user are now taking the lead on data process while data engineer and data scientist are here to help them to scale.
Martin Magdinier is a serial entrepreneur with a passion for data. He has been engaged with innovative start-ups and open data communities in France, Vietnam, and Canada since 2007. Coming from a business background (with a Master’s degree in IT Management), Martin Magdinier’s focus is on data management and transformation tools that empower the business user. Through his recent projects (TTCPass, 2012 Google Places API Developer Challenge
Judge’s Choice Award and Objectif Neige) and consulting positions at Alleyne Inc. Martin developed experience in bridging usage and technical teams to deliver value with a specific focus on data related issue. In 2011 Martin discovered OpenRefine and after working close with the community became one of the three community leader in 2012. In late 2014, he launched RefinePro, a cloud-based SaaS data cleaning platform based on OpenRefine.
Martin Magdinier, RefinePro