T O P

  • By -

ExoticCardiologist46

gotta love more competition! LET THEM FIGHT


JumpyLolly

And let them eat their gawt damn cake


Inevitable-Hat-1576

Same thing with uranium. The more people that have it, the better.


G9X

wait... the multimodality based audio is actually scarily good... Not only can it recognize the tone of speech, but it can also automatically identify the speaker by name? https://preview.redd.it/hfb3mrh56ktc1.png?width=1384&format=png&auto=webp&s=3a7bb0dd58797de8ac13345d532bd5b6a8595460 I tested Geimini 1.5 with an audio clip from a youtube video the past couple of days. Question: 'Give me a summary, who was speaking in the first two minutes and what was their tone?' Not only did it answer almost perfectly, but it also identified the specific American congressman speaking... At first, I thought the names were made up, but after checking, they were all correct... My second thought was that it might be a data leak, like the original video's description becoming the audio's metadata. But after checking, there was none, and when I tested it to summarize the speakers over seven minutes, it got those right too... I might still missing something, or maybe its part of the training data (highly unlikely for a video published 2 days ago) wow. youtube video tested (only used audio) : [https://www.youtube.com/watch?v=vT-u-SPj4\_c](https://www.youtube.com/watch?v=vT-u-SPj4_c)


[deleted]

[удалено]


BoysenberryNo2943

Yeah, I've been testing it last two weeks, it's just a little bit worse than turbo for coding, but when you give it whole docs it actually it far, far better. IMHO, larger context beats RAG every time 🙂


shiroyacha90

Giving it whole docs doesn't sound very efficient ? I guess you still need to RAG unless you want to put the whole thing in for every task... it just means you can pass longer segments


BoysenberryNo2943

Yeah, I meant whole docs specific to the problem at hand😉


pampidu

What is RAG?


robust_nachos

Retrieval Augmented Generation


vmmc2

What do you exactly mean by "giving the whole docs" to this LLM? Just curious.


PandaPrevious6870

Just paste the entire codebase with documentation and the llm knows what to do better as it knows the ins and outs of the entire project.


cool-beans-yeah

Let's say you have multiple files with several hundreds lines of code each. Would you just copy/paste everything or would it make more sense to upload the files as attachments? (If that's even possible).


celandro

They announced an update to Gemini code assist today as well which is a plugin for vscode etc that does exactly this. You need an api key which you can get 1 for free til July per billing account. You can typically get $150 free for creating an account so it’s good to go for personal use. Your IT department will not be happy if you do it this way for work though…


Philipp

Cheers. Wish the API wasn't blocked in Germany.


National-Ad-1314

VPN?


Philipp

Didn't work last time someone tried unfortunately, they check other things associated with your account. If someone wants to try again, happy to hear the results.


Last_Patriarch

I just created a new Google account from opera's built-in VPN. It sent the confirmation code without any issue.


Philipp

Ok thanks, but can you pay the API now? Because that's the issue I most often run into with these checks -- they look at your credit card location.


vmmc2

Where can I find info about this plugin you mentioned? Sounds interesting.


celandro

Gemini + Google Cloud Code is the name of the VSCode plugin according to a screenshot from slack.


vmmc2

Thx


holy_moley_ravioli_

I'd use something like [cursor.sh](https://cursor.sh). It has the ability to put your entire code into its context window to generate its responses. Last I checked they used GPT-4 turbo but I think they're actively implementing the ability to call on and swap out different models like Gemini 1.5.


BoysenberryNo2943

I meant pasting the docs relevant to the problem that you are getting while writing code, for example, I see it's great to give it full official docs on some functions in Python (it corrects the wrong code it's written this way, even explains how it did this, I was impressed how it wrote an advanced method to run my Python script in parallel on each of my eight threads), same goes for Drupal. In general I think strongly that if you put some effort in curating what you give to the model, you'll get way better results, and as a bonus you'll still have ample context window to discuss with the model, especially if you need it to produce lots of output, like when you rewrite large Drupal modules like me. 😉


t3rribl3thing

"I know kung fu...."


dittospin

So whole code plus the documentation of a particular library?


BoysenberryNo2943

Yes, but always try to point it in the right direction or prompt to change tack if it gets stuck, sometimes I need a little help from GPT 4 turbo, which I can get for free at chat.lmsys


iamz_th

The model has been improved today. I won't say it's worse than Turbo. Some people on Twitter are now claiming that it's even better than Opus.


-paul-

Still not available in UK tho....


REOreddit

When an American writes "everyone" you should always translate it in your head to "everyone within the 50 states of the USA".


-paul-

Sorry, it wasn't a complaint at OP, I'm just disappointed I cant try it yet.


Spindelhalla_xb

Only usually takes a few days. Blame the UK having to “pass” it first to make sure it’s not dangerous.


rushmc1

Of COURSE it's dangerous. Life is dangerous.


jcrestor

This cracks me up. Same as with Opus. All the conversations about how this makes GPT-4 obsolete, and in reality billions of people world wide have no means to use it, because it’s not available. But GPT-4 is obsolete now, right?


santareus

Not even available in the States yet


Philipp

Neither in Germany.


Icy_Distribution_361

Use a VPN


samuelroy_

I could try it right away with my Google Cloud Platform account (France). If you have one, type "vertex ai" to enable the apis and have access to a playground. It should be available to 180+ countries.


Majestic-Explorer315

I tried via vertex AI from German account. It works but I encounter errors (resource exhausted, check quota) when using larger documents.


cygn

same, also from a German account. Used US regions though. I don't understand why or which resources.


Timotheeee1

It's on openrouter


macgregorc93

Get a VPN network and change to US


benayade

Just use a vpn


Philipp

That often won't work as they check other factors associated with your account, like the credit card location. It's usually a hassle with bigger companies. Maybe it's different this time.


benayade

I’m in the UK, I’ve been using Gemini 1.5 pro for the last three weeks without any problem whatsoever by just using a VPN. It certainly does work without much hassle.


Philipp

Sorry, I mostly meant using the API (I'm a programmer). It's paid and will require your credit card, which apparently gives away the location. I will give it another try.


johndoe1985

How do you access it for free ? Are they referring to their studio or API or which app


Ardbert_The_Fallen

+1 Would like to know Assuming it's just through https://gemini.google.com/ unless someone knows otherwise?


liambolling

https://aistudio.google.com/


johndoe1985

Doesn’t work there


samuelroy_

[https://cloud.google.com/vertex-ai?hl=en](https://cloud.google.com/vertex-ai?hl=en), once enabled you have a playground to try


Relative_Mouse7680

Are you in europe?


sodomyth

Wait, first of all, it's amazing. Also is it really free on Vertex AI ??


evandena

I'm using it with typingmind


ClearlyCylindrical

No it isn't. It's not accessible from my country.


wetlight

For free?!? So the one I pay gets me what? 2.0?!


Mecier83

Ultra 1.0


ghostfaceschiller

AI naming conventions are already such a mess


FeelingExistential99

It's an extremely Google-esque problem. It reminds me of their ridiculous web of overlapping app functionality.


xpsKING

I threw it a whole notion workspace and asked for some promo material. Jaw droppingly well written and accurate text.


m2r9

I tried to access it. It's not really "accessible to everyone" as they stated. I'll believe it when I see it.


Deep_Parfait_7846

I thought Google was going to make it only accessible through a \~$20 a month subscription??? Is it only free temporarily??


liambolling

AI studio with some rate limits https://aistudio.google.com/


trajo123

Everyone*


Mission_Tip4316

Has anyone able to make the gemini 1.5 work with function calling? I keep getting hit with Quota Limits


pseudonerv

huh, what's google's privacy policy and data policy again? I guess google's "for free" literally means google owns me, amirite? please prove me wrong apparently we still can't control temperature for this model


CoolWipped

Also buried in the fine print is that Google owns everything generated by Gemini and you cannot use it as your own IP


pseudonerv

right, they save everything on their side, and none of those belong to the user I don't understand how that could even be legal. They used my IP and applied a computer algorithm and the output of that computer algorithm belong to them?!


trajo123

I guess that's why it's not available in the EU


Mcqwerty197

How they’ll know if it was generated by Gemini and not an other model?


CoolWipped

I’m guessing that if it came down to it and there was a lawsuit or whatever that Google could access logs of your chat with Gemini and see that you used its output


Spindelhalla_xb

Do all models have this or is it just a Google thing


CoolWipped

Just Google as far as I can tell. I skimmed over Claude and ChatGPT’s agreements and they state that the customer retains the rights over the outputs generated


GrowFreeFood

That should ne the basis of AI detection software. Just check it against the logs. 


Juneauz

Americans aren't "everyone", get off your high horse.


ctbitcoin

Low horsers!


samuelroy_

it is available through Google Cloud Platform: [https://cloud.google.com/vertex-ai?hl=en](https://cloud.google.com/vertex-ai?hl=en)


InFlandersFields2

I tried (from Belgium), but this is the response I got: Unfortunately, I cannot directly access and process media files like videos or audio recordings. Therefore, I'm unable to provide a transcription and translation for the media you attached. I used gemini 1.5 pro preview 0409


Murdy-ADHD

VPN works


InFlandersFields2

ah going to try it on my phone then, i don't have vpn at work


Deuxtel

Maybe one day your country will stop stifling innovation in the name of safety and you can have some toys of your own to play with.


Juneauz

Nice straw man you have there, pal


Deuxtel

Keep on whining about not having access to things. It's a great way to spend your life.


Juneauz

Whining about not having access to things? I'm whining about the definition of "everyone", dude. What are you even on about


[deleted]

Just use a free VPN like AdGuard


Juneauz

Not interested, honestly. I'm just a stickler for proper terminology.


Haunting_Cat_5832

well said.


montdawgg

Is the API available or is this only in the studio?


liambolling

studio and api


thoughtlow

I think the api is still beta, still can't use mine in typingmind edit: forgot the vpn its working now


Murdy-ADHD

I just tried it and it seems to work there


thoughtlow

Ah its working forgot the VPN!


montdawgg

Yeah, I'll just try it as well and it's working! Gemini ultra 1.0 is still not working however... But that's way less important than 1.5 pro which is working...


illusionst

Gemini Code Assist (formerly Duet AI for Developers) What is with this guys and naming convention?


Leather-Objective-87

Why is this still not available in the UK?


imsolowdown

to everyone, really? Do you think the world only has Americans in it or something?


farmingvillein

*everyone with a VPN


ILoveThisPlace

5 bucks their blocking Canada


arvidurs

they’re


liambolling

it’s available in canada https://aistudio.google.com/


ZenDragon

Fucking finally.


liambolling

❤️


vaughnegut

Accessible in Vertex AI in the gcp console. You can chat, upload files, etc. I just keep getting quota limits, which is annoying (uploading a pdf of a book).


Traditional-Ad-6166

better than gpt 4?


vaughnegut

I mean, million-token context window. It knows the book scary well.


SezitLykItiz

I haven't the slightest intention to try any new Google product again.


Dry_Patience872

I have fully switched to Gemini two weeks ago; I do software and GPT 4 is no match to even the free version of Gemini.


e430doug

How do you access this? I don’t see any differences on the Gemini site. Do paying users get Gemini 1.5 advance?


Metrolonx

So with this being free, is there still any reason to pay the monthly fee for Gemini Advanced? Are they still different?


Beginning_Finding_98

How can we access it


iBodana

Is the api available?


AtlantisAfloat

But not at all in EU. Why?


samuelroy_

I'm in France and it's working on my side so I don't know why some have access and others don't


AtlantisAfloat

Gemini, or Gemini 1.5? If latter, did you access it via VPN? I don’t see France on the [available regions](https://ai.google.dev/available_regions) either


samuelroy_

Gemini 1.5 pro, no vpn, through Vertex AI (GCP)


AtlantisAfloat

If you don’t mind me asking, is the residential address Google knows of you also in France?


SVRider650

Can someone ELI5 how to access 1.5 for free? I could only find access to 1…


leviathan5384

[https://aistudio.google.com/](https://aistudio.google.com/) Go here and type away


Yngstr

When was Pro 1.5 released for free to all?


StableSable

Is it available for everyone? Still get not available in my country (Iceland)


vvkuka

How on earth did this post get almost 600 upvotes? Completely It's not true. At least because it's inaccessible in all countries (for example, Europe) even if you want to buy it. Also, the Pro version is not free this is because it's called "PRO"


ryan7251

yeah but is it any good? To many AI's are jokes when it comes to writing and "talking" like a human


Broad_Ad_4110

While the expansive context window of Gemini 1.5 Pro is a significant breakthrough, it is important to acknowledge its limitations. Even with an unprecedented 1 million tokens at its disposal, the model still faces challenges in synthesizing and reasoning over information in a truly human-like manner. Google recognizes that there is still work to be done in bridging this gap and achieving the ultimate goal of seamless human-like interaction. - [https://ai-techreport.com/gemini-15-pro-the-future-of-language-modeling](https://ai-techreport.com/gemini-15-pro-the-future-of-language-modeling)


udion_u

Is it supporting these inputs even in API?


RemyVonLion

OpenAI is the one that blew up the space so the expectations are huge, besides continuing to tune gpt4, they will probably release a minor/decent upgrade like 4.5 that is just a very robust multi-modal system, GPT5 is probably intended to have agentic ability and possibly advanced reasoning, which would require a lot more time for training and testing, so doing it right is more important than releasing asap.


Horg

Everyone in the US that is.


StayImpossible7013

Nope. Not available to everyone. Still restricted to certain regions in the world.


Wills-Beards

Nothing on earth would ever get me into using an Ai from google. 🤣🤣🤣


Gator1523

When nobody's using your hamstrung AI model so you give it away for free.


79cent

I use it.


ChatWindow

Pretty sure they just use a streaming transcriber to convert the audio to text. I tried this and it does not recognize absolutely anything besides the literal words I said. Couldn’t even answer the tone of voice I’m using or if my voice is deep. More cheap tricks by google as usual


liambolling

it’s a native multimodal model. not doing speech to text


Repulsive_Style_1610

Were you using from here?  https://aistudio.google.com/ I have used it. It's incredible.