The crazy thing is the level of effort to say, "have a sub agent validate all references and figures" is so low. I'm paraphrasing, but you don't need much more than that. It would have prevented 99% of the face palms.
I use this regularly for my personal financial research system. Even flagship models make mistakes. Though currently the issue is usually the model using a figure from and older report. Cross-check reduces that dramatically.
Eh.. without going into too many details, having seen some face palms at work, I realized that the anecdotes may be closer to a pattern than I would like to believe, which prompted me to start basic howtos available company-wide.
I kinda get it, without experience and trying, how are they to know ( unless they are already 'into it')? After all, corporate training is laughable at best.
> Professional services firm KPMG has pulled a report titled, “Redefining excellence in the age of agentic AI,”
Well they were true to their word about demonstrating a new and increasingly relevant definition of "excellence."
KPMG got called out only now for bullshit and hallucinations?
Gartner is going to have to pull a loooot of reports over the years
The crazy thing is the level of effort to say, "have a sub agent validate all references and figures" is so low. I'm paraphrasing, but you don't need much more than that. It would have prevented 99% of the face palms.
I use this regularly for my personal financial research system. Even flagship models make mistakes. Though currently the issue is usually the model using a figure from and older report. Cross-check reduces that dramatically.
Eh.. without going into too many details, having seen some face palms at work, I realized that the anecdotes may be closer to a pattern than I would like to believe, which prompted me to start basic howtos available company-wide.
I kinda get it, without experience and trying, how are they to know ( unless they are already 'into it')? After all, corporate training is laughable at best.
dont be so sure they didnt. they can go back and forth hallucinating with each other
Go, GPTZero!
[dupe] https://news.ycombinator.com/item?id=48515733
The register article is better.
Every once in awhile, someone utters a truly unique statement.