Identification of Sustainability-Related Research at USC through Machine Learning and Keyword Mapping
Description
Are you passionate about data science and sustainability? Then this interdisciplinary project is for you! Here, we will develop a machine learning program to identify USC research publications and grants as ‘sustainability-focused’, ‘sustainability-inclusive’ or ‘not-sustainability-related’ by using pre-categorized publication samples. In addition, we will use keyword lists that relate to the 17 UN Sustainable Development Goals (SDGs) to map all research groups at USC as they relate to these SDGs (https://sdgs.un.org/goals). Lastly, we will create an interactive dashboard in R Shiny that will act as a public directory of all research at USC with classification of the research by the SDGs and broader sustainability categorization. As an example, check our github for USC curriculum: https://github.com/USC-Office-of-Sustainability/USC-SDGmap . Your work on this project is critical in boosting sustainability-related research at USC and thereby achieving our Asgmt: Earth Research Goals."
Awards
Best Data Science Collaboration Practices
Best Data Science Teamwork
Highlighted Project
Students
Advisors
Skills Required by the team
- R
- Python
- Web Scraping
- Machine Learning