¶ The Data Mine (TDM) |
![]() |
|
1301 Third Street, West Lafayette, IN 47906-4206 https://datamine.purdue.edu Contact: Mark Daniel Ward, datamine@purdue.edu |
Education and Research |
Role(s) |
• Located in the heart of Purdue University, The Data Mine gives hundreds of students from a wide variety of backgrounds and majors, both graduate and undergraduate, the opportunity to learn data science skills and apply them to research projects.
|
Mission | The Data Mine is a living, learning and research-based community created to introduce students to data science concepts and equip them to create solutions to real-world problems. Members of The Data Mine are part of a team, living, studying and ultimately, performing data-driven research together. The Data Mine is part of Purdue University’s Office of the Provost and is designed to train students across all majors with the data literacy needed to succeed in a data-driven world. |
History | 2014-19 – Statistics Living Learning Community funded by a $1.5 million NSF grant for 20 sophomore students per year. These 102 students produced more than 175 journal articles and conference presentations/posters on topics ranging from human development and family studies to marine biology and image processing. 2018-19 – The Data Mine was piloted with 100 undergrad students with one Corporate Partner 2019-20 – The Data Mine rolled out to ~600 undergrad students with 12 Corporate Partners 2020-21 – The Data Mine continued to serve ~600 undergrad students and ~70 grad students with 26 Corporate Partners 2021-22 – The Data Mine had ~800 undergrads and ~100 grad students; 45 Corporate Partners. First year of Indiana Data Mine (IDM) funded by Lilly Endowment. 2022-23 – The Data Mine had ~1000 undergrads and ~100 grad students; 55 Corporate Partners. IDM continues to grow and The National Data Mine Network (NDMN) is launched to support 100 students per year at Minority Serving Institutions. |
Org |
• Professional Staff of 24 Leadership (all can be reached at datamine@purdue.edu): |
Board | The Data Mine has an External Advisory Council (https://datamine.purdue.edu/corporate/), with rotating representatives from Corporate Partners. |
Finance | • Corporate Partnerships • Grants, including Lilly Foundation and National Science Foundation • Office of the Provost, Purdue University |
Data Source | • Corporate Partners provide data for their projects. Students usually sign NDAs. • Publicly available data used for training purposes in the data science skills seminar. • Faculty provide mentoring and data for research projects. |
Data Access | Data goes through a process including a data sharing agreement and security review. The formal process is managed through Purdue’s Office of the Executive Vice President for Research and Partnerships. |
Tech Capabilities | • Secure cluster computing resources available through Purdue’s Research Computing (RCAC). • Data Scientists on staff can help plan projects. |
Projects | • Examples of Corporate Partners projects: https://datamine.purdue.edu/symposium/welcome.html • Our Examples Book, which includes the data-infused seminar material taught to all students: https://the-examples-book.com |
Future Focus | • The Data Mine is a program that embodies Purdue’s vision of offering Data Science for All. • The Indiana Data Mine is a statewide expansion, supporting regional opportunities for students in a way that also addresses the workforce talent needs in the state of Indiana. • The National Data Mine Network is a nationwide expansion, with more than 100 Minority Serving Institution partners. The NDMN enables students at MSIs to have access to Data Mine courses, research opportunities, and industry partnerships. |
Talent Development | • Talent development is our primary goal—we want to empower data scientists of the future. • Students from all majors need to have strong data science skills. |
Data Sharing Agreements | Sponsor Acknowledgment: https://datamine.purdue.edu/corporate/docs/sponsoracknowledgment.docx |
Selected Publications |
• Gundlach E and Ward MD. The Data Mine: Enabling Data Science Across the Curriculum. Journal of Statistics and Data Science Education. 2021; 29(1):1-14. Available from: https://doi.org/10.1080/10691898.2020.1848484
|