Analyzing Open Source Software Ecosystems

Description

Open source runs a lot of the world's critical software systems, but there is much that's unknown in how maintainers, developers and other parts of the software ecosystem function. Help us analyze a large corpus of open source data — both source code and patch conversations — to better understand them! We'll study things like rise to influence, authorship styles, malware analysis, topic modeling and social network analysis!

Students

Advisors

What students will learn

We'll touch on using LLMs to parse text messages and analyze code, graph databases, program analysis, and social network analysis among other skills