Language Modeling Insights

Connor discusses how models fall back on prior conditioning when faced with weak information. Yannic is surprised by the GPT-3 paper's impressive results without tweaking. They ponder the limits of pushing garbage input before models default to the most probable output.