Thread with 3 posts
jump to expanded postBouta skip the language model and autistically read all the training data into my own brain instead.
Seriously though what the hell are some of these outputs they're teaching these things "he was staring at the beautiful mexican girl" as an "answer" to a random rant.
https://huggingface.co/datasets/cognitivecomputations/dolphin?row=30
Maybe this is a side effect of using AI to generate datasets?
@mauve this looks like parsing out the "Answer the following question:" was the trigger to pick one of "-"-items at the end of the text