Identification of Sustainability-Related Research at USC through Machine Learning and Keyword Mapping

Description

Are you passionate about data science and sustainability? Then this interdisciplinary project is for you! Here, we will develop a machine learning program to identify USC research publications and grants as ‘sustainability-focused’, ‘sustainability-inclusive’ or ‘not-sustainability-related’ by using pre-categorized publication samples. In addition, we will use keyword lists that relate to the 17 UN Sustainable Development Goals (SDGs) to map all research groups at USC as they relate to these SDGs (https://sdgs.un.org/goals). Lastly, we will create an interactive dashboard in R Shiny that will act as a public directory of all research at USC with classification of the research by the SDGs and broader sustainability categorization. As an example, check our github for USC curriculum: https://github.com/USC-Office-of-Sustainability/USC-SDGmap . Your work on this project is critical in boosting sustainability-related research at USC and thereby achieving our Asgmt: Earth Research Goals."

Awards

  • Best Data Science Collaboration Practices

  • Best Data Science Teamwork

  • Highlighted Project

Students

Advisors

Skills Required by the team

  • R
  • Python
  • Web Scraping
  • Machine Learning

Final presentation resources