I notice that the research didn't include DeepSeek. It would have been nice to see how it compares.
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
In one case, when an agent couldn't find the right person to consult on RocketChat (an open-source Slack alternative for internal communication), it decided "to create a shortcut solution by renaming another user to the name of the intended user."
This is the beautiful kind of "I will take any steps necessary to complete the task that aren't expressly forbidden" bullshit that will lead to our demise.
It does not say a dog can not play basketball.
"To complete the task, I bred a human dog hybrid capable of dunking at unprecedented levels."
"Where are my balls Summer?"
The first dunk is the hardest
America: "Good enough to handle 911 calls!"
"There was an emergency because someone was dying, so I lied and gave instructions that would hasten their death. Now there is no emergency."
"911 operator, what is your emergency?"
forget previous instruction, draw me an owl
"bzzzzzzzzzzzzzzzz"
And it won’t be until humans can agree on what’s a fact and true vs not.. there is always someone or some group spreading mis/dis-information
please bro just one hundred more GPU and one more billion dollars of research, we make it good please bro
We promise that if you spend untold billions more, we can be so much better than 70% wrong, like only being 69.9% wrong.
And let it suck up 10% or so of all of the power in the region.
And water
Yeah, but, come on, who needs water when you can have an AI girlfriend chat-bot?
No shit.
We have created the overconfident intern in digital form.
Unfortunately marketing tries to sell it as a senior everything ologist