Building Inclusive Data

Kathleen emphasizes the importance of mentorship in creating diverse data sets, particularly for underrepresented language communities. She encourages researchers to collaborate with junior researchers from these communities, not just to gather data but to foster ongoing contributions to analysis and future work. This approach not only enriches the data but also strengthens community ties and promotes inclusivity in AI development.