Open data science is a philosophy designed to address emerging challenges for society in data. The Open Data Science Initiative is a cross faculty project built around the ideas in this white paper.
Many talented people would like to see their ideas and work being applied for the widest benefit possible. The modern internet provides tools such as GitHub, jupyter notebook and reddit for easy distribution and comment on this material. In Sheffield we make our ideas available through these mechanisms.
The idea of open data science is to:
- Make new analysis methodologies available as widely and rapidly as possible with as few conditions on their use as possible (see the ML@SITraN group software pages and the local software page).
- Educate our commercial, scientific and medical partners in the use of these latest methodologies (see http://gpss.cc)
- Act to achieve a balance between data sharing for societal benefit and the right of an individual to own their data. (see our summary of our efforts in public understanding and debate)