Bias in Language Models

Tim and Minqi discuss how pruning language models introduces bias by favoring human preferences, impacting the diversity of generated answers. The choice of humans providing preference data plays a crucial role in shaping the model's output.