SuspciousCarrot78@lemmy.world to Privacy@lemmy.ml · edit-227 days agoI'm tired of LLM bullshitting. So I fixed it.codeberg.orgexternal-linkmessage-square87linkfedilinkarrow-up1503arrow-down132file-textcross-posted to: privacy@lemmy.ml
arrow-up1471arrow-down1external-linkI'm tired of LLM bullshitting. So I fixed it.codeberg.orgSuspciousCarrot78@lemmy.world to Privacy@lemmy.ml · edit-227 days agomessage-square87linkfedilinkfile-textcross-posted to: privacy@lemmy.ml
minus-squareThirdConsul@lemmy.ziplinkfedilinkarrow-up5arrow-down1·4 months agoA very tailored to llms strengths benchmark calls you a liar. https://artificialanalysis.ai/articles/gemini-3-flash-everything-you-need-to-know (A month ago the hallucination rate was ~50-70%)
A very tailored to llms strengths benchmark calls you a liar.
https://artificialanalysis.ai/articles/gemini-3-flash-everything-you-need-to-know (A month ago the hallucination rate was ~50-70%)