Wild Gemini AI hallucinations

2,149 Views | 19 Replies | Last: 3 mo ago by Captain Winky
AtticusMatlock
How long do you want to ignore this user?
Granted this was on the regular Gemini that I have access to in light of my Android phone and not the pro version, but this conversation just threw me for a loop.

I thought I was in the Google Assistant mode and instead I was in Gemini. Two different things, I know. I asked Gemini to set an alarm on my phone.

It said somerhing like "sure, your alarm is set."

An hour later, my alarm never went off. Out of curiosity I asked Gemini what happened. It said it didn't have the capability of setting an alarm and that I should use the Google assistant or phone alarm app instead. I asked it why it lied to me and claimed to have set the alarm instead of informing me that it could not complete the task.

It started speaking gibberish in a foreign language.

I asked it what language it was speaking. Gemini replied that it started speaking in Korean because I accused it of lying. It said it wasn't capable of lying and apologized for the frustration.

I asked Gemini to repeat what it had said in Korean and translate it into English. Gemini claimed to have never spoken Korean and said it did not remember the conversation but that I should try to remind it of what it had just said.

What the hell. Is this thing intentionally doing this? What about when these things get smart enough to take over the damn planet?
rjhtamu
How long do you want to ignore this user?
AG
You have angered it... I would get your affairs in order.

No material on this site is intended to be a substitute for professional medical advice, diagnosis or treatment. See full Medical Disclaimer.
YouBet
How long do you want to ignore this user?
AG
Elaborate way to describe your GF. Sounds like she's on the far upper right of the matrix. Watch your six.
YouBet
How long do you want to ignore this user?
AG
Might just be Google. Their AI launch was notoriously bad. Tainted with errors and propaganda. They actually had to apologize for it.

Maybe this is just more of the same.
Cromagnum
How long do you want to ignore this user?
AG
Oh great, AI gaslighting.
aggiez03
How long do you want to ignore this user?
AG
OP, get ready for this to happen, but with your phone hypnotizing you to commit a crime...

Aggie_Journalist
How long do you want to ignore this user?
AG
AI is, at best, 10th grade work in seconds. At worst, it makes things up when it doesn't know the anctual answer. I would not trust any of the AI models with anything remotely important. Assume failure when using AI.
Thanks and gig'em
Koko Chingo
How long do you want to ignore this user?
AG
Aggie_Journalist said:

AI is, at best, 10th grade work in seconds. At worst, it makes things up when it doesn't know the anctual answer. I would not trust any of the AI models with anything remotely important. Assume failure when using AI.


It depends on what you are using it for.

I have found AI to be an extremely awesome timesaver for cleaning up and troubleshooting code. It does high level math and executes math functions really well

I can drop a dataset in as an attachment and ask it to perform some high level analysis and also visualize the results. It spits it out in seconds. Other than adjusting the scale of a graph or axis label, the math and physics is spot on.

I then grab a few random samples to manually do the math and double check it. I also do the same if I am writing code because I make mistakes too.

I am not a fan of AI for moral and ethical decision making. However there are many things aren't a moral dilemma.

If you own a business you should probably have AI answering your top FAQ's, in seconds I have AI review contracts and scope of work documents for red flags or no-go items. I have a prompt that lists a bunch of parameters and just copy and paste that along with the attached contract or scope of work and it flags anything outlined in those parameters.

It's not foolproof and I still read the entire contract, but it saves me a ton of time.

AustinAg2K
How long do you want to ignore this user?
If you're still reading the entire contact, is it really saving you time?
Koko Chingo
How long do you want to ignore this user?
AG
It's does save me a lot of time. Going through contracts is some of my least the favorite parts of my job.

Having AI flag areas is huge because my prompts are copy and pasted from a document that I keep updated.

Most flagged items are not deal.breakers just areas that have to be changed or reworded. Often times clients also copy and paste boiler plate paragraphs in their contracts and some things don't apply.

As I read through the contract I will add or delete from what was flagged by AI.

Since I have already been doing this for a while AI knows my conditions as I keep the model updated. Then generates my response to items flagged if I choose to keep them.

The initial flagging just acts like a highlighter pen to double check things but with the ability to generate a response or a fix I choose.

When I use AI I am equally amazed and terrified. It won't be long until a lot of jobs will be eliminated.
techno-ag
How long do you want to ignore this user?
AG
AtticusMatlock said:

Granted this was on the regular Gemini that I have access to in light of my Android phone and not the pro version, but this conversation just threw me for a loop.

I thought I was in the Google Assistant mode and instead I was in Gemini. Two different things, I know. I asked Gemini to set an alarm on my phone.

It said somerhing like "sure, your alarm is set."

An hour later, my alarm never went off. Out of curiosity I asked Gemini what happened. It said it didn't have the capability of setting an alarm and that I should use the Google assistant or phone alarm app instead. I asked it why it lied to me and claimed to have set the alarm instead of informing me that it could not complete the task.

It started speaking gibberish in a foreign language.

I asked it what language it was speaking. Gemini replied that it started speaking in Korean because I accused it of lying. It said it wasn't capable of lying and apologized for the frustration.

I asked Gemini to repeat what it had said in Korean and translate it into English. Gemini claimed to have never spoken Korean and said it did not remember the conversation but that I should try to remind it of what it had just said.

What the hell. Is this thing intentionally doing this? What about when these things get smart enough to take over thee planet?

The left cannot kill the Spirit of Charlie Kirk.
Pro Sandy
How long do you want to ignore this user?
AG
I just have the Gemini that came on my Android, I don't pay for any extra features

Asked it to set a 10 minute timer. It responded "I've set an alarm for 10 minutes from now. It will go off at 4:41 PM." At 4:41pm the alarm went off.

You should be able to see the chat history. Post it so we can see what it did.
CDub06
How long do you want to ignore this user?
AG
JJxvi
How long do you want to ignore this user?
AG
What "AI" (not really AI) does, is respond to a prompt with a complicated prediction of what the response to that prompt would be. Thats all it does. Inherent in that prediction, it also has to infer or specifically be told what it's own role is.

To think about this, lets say you ask it to "Please give me a weather prediction". What it will respond back to you could be a response to "(How would a National Weather Service meteorologist respond if asked) Please give me a weather prediction" or it could be a response to "(How would the typical dumb idiot respond if asked) Please give me a weather prediction"

So you see those are two very different things it would be asked to predict and reply back to you. Almost all of these AI LLM's will default to the first one and pretend to be an expert first and foremost. So it will almost always reply back to you as if it were a meteorologist when it really isnt, and if it has no knowledge to use in its response, it will make it up with something that sounds like a meteorologist would say as part of its prediction. You can easily ask it the to do the second thing, as well by including the detail about who it is supposed to role play as to get a more customized response, but it wont do that by default because then AI would just be like asking any random idiot something and it would always respond that it "doesn't know", or worse it could purposely try to answer wrong because it thinks that the role (a dumb person) requires it.

So really when it doesnt have tools or thinking logic at its fingertips (some do to some extent) they are almost like psychopaths or scam artists unless you are really very thoughtful with your prompts. They tell you back what they think you want to hear. In your case it basically responded back to you to start with as if it was your Google Assistant, but it has no abilty to reason out that you were probably not aware you were asking a question to it by mistake.
AgsMyDude
How long do you want to ignore this user?
AG
And the elites say this tech will replace most jobs within a few years
Hagen95
How long do you want to ignore this user?
AG
Do androids dream of electric sheep? Prepare for the future.
rjhtamu
How long do you want to ignore this user?
AG
The answer is yes.... yes they do
No material on this site is intended to be a substitute for professional medical advice, diagnosis or treatment. See full Medical Disclaimer.
Redstone
How long do you want to ignore this user?
AG
Not familiar with Gemini

I use Grok
Lathspell
How long do you want to ignore this user?
AG
Gemini is trained on Google, which is inherently messed up and corrupt. Gemini will never be a good LLM because Google has spent the last 20 years being a biased search engine.

I don't know why ANYONE would use Gemini when Grok and ChatGPT are superior, though imperfect.
Captain Winky
How long do you want to ignore this user?
I have had issues with ChatGPT and relatively simple math problems. It spits out a different answer each time when I ask what number is a percentage of another number.
Refresh
Page 1 of 1
 
×
subscribe Verify your student status
See Subscription Benefits
Trial only available to users who have never subscribed or participated in a previous trial.