Firefox introduces AI as experimental feature

potentiallynotfelix@lemmy.fish · 9 months ago

Firefox introduces AI as experimental feature

davel [he/him]@lemmy.ml · 9 months ago

Leaflet@lemmy.world · 9 months ago

That was there before 133, don’t remember the exact release that added it.

nu11@sh.itjust.works · 9 months ago

I don’t understand the hate. It’s just a sidebar for the supported LLMs. Maybe I’m misunderstanding?

Yes, I would prefer Mozilla focus on the browser, but to me, this seems like it was done in an afternoon.

PrefersAwkward@lemmy.world · edit-2 9 months ago

It seems like common cynicism. Mozilla adds this feature, as not to yield major features to other browsers. Mozilla’s lets you natively have lots of different AI solutions to pick from.

Not every feature is for everyone. Not every feature is done being improved on at release.

And in spite of popular opinions, organizations don’t do just one thing and then do just the next thing and the thing after that. Organizations can and do focus on and prioritize many things at the same time.

And for people who are naysaying AI at every mention, it has a lot of great and fascinating uses, and if you think otherwise, you really should try them more. I’ve used it plenty for work and life. It’s not going away, might as well do some nice things with it.

Scrollone@feddit.it · 9 months ago

I want my browser to be a browser. I don’t want Pocket, I don’t want AI, I don’t want bullshit. There are plugins for that.

ToxicWaste@lemm.ee · 9 months ago

that’s the great thing: you don’t have to use it

Scrollone@feddit.it · 9 months ago

But Firefox wastes time developing that instead of fixing 20 years-old bugs.

ToxicWaste@lemm.ee · 9 months ago

i know it is an unpopular opinion around here. but currently AI features open doors for sales. that is important.

for the software i help develop, we introduced an optional AI integration. just its presence allowed us to sell the main SW multiple times. the AI plugin was never sold so far.

investment AI: 2 weeks of gluecode. i am not concerned with finances, but that plugin is for sure net positive.

LWD@lemm.ee · edit-2 2 months ago

deleted by creator

ToxicWaste@lemm.ee · 9 months ago

right now we don’t have any real customers that use it - as the plugin did not sell yet.

but from testing at customer sites with real people that would use it - we got only positive feedback. which is not hard to imagine: the RAG + LLM enables less experienced users to navigate a huge and complex network of information.

but it for sure is also a buzzword execs like to see: they talked to us because we have AI. saw that the main product is good. bought the main product and decided the AI is too expensive.

in the end it doesn’t matter to me. the 2w of AI was a fun sidequest and it left us with a passive boost for sales.

JokeDeity@lemm.ee · 9 months ago

Unpopular opinion, I think they’re doing it right as well as it can be at least. It’s completely optional and doesn’t seem to be intrusive.

potentiallynotfelix@lemmy.fish · 9 months ago

yeah its not google chrome level which i’m thankful about.

JokeDeity@lemm.ee · 9 months ago

I’m way more pissed about restarting my PC after an update and having Copilot installed without my permission.

potentiallynotfelix@lemmy.fish · 9 months ago

Here’s the solution for you

Read Bio@lemm.ee · 9 months ago

I agree

cmnybo@discuss.tchncs.de · 9 months ago

They better not decide to enable it by default.

metaStatic@kbin.earth · 9 months ago

it’s not enabled by default … it’s opt out by default

Vincent@feddit.nl · edit-2 9 months ago

I think that means that it’s opt-in.

adarza@lemmy.ca · 9 months ago

if third-party accounts are needed, it’ll have to stay that way.

Sir Arthur V Quackington@lemmy.world · 9 months ago

Thing is, for your average user with no GPU and whp never thinks about RAM, running a local LLM is intimidating. But it shouldn’t be. Any system with an integrated GPU, and the more RAM the better, can run simple models locally.

The not so dirty secret is that ChatGPT 3 vs 4 isn’t that big a difference, and neither are leaps and bounds ahead of the publically available models for about 99% of tasks. For that 1% people will ooh and aah over it, but 99% of use cases are only seeing marginal gains on 4o.

And the simplified models that run “only” 95% as well? They can use 90% fewer resources give pretty much identical answers outside of hyperspecific use cases.

Running a a “smol” model as some are called, gets you all the bang for none of the buck, and your data stays on your system and never leaves.

I’ve been yelling from the rooftops to some stupid corporate types that once the model is trained, it’s trained. Unless you are training models yourself, there is no need for the massive AI clusters, just for the model. Run it local on your hardware at a fraction of the cost.

LWD@lemm.ee · edit-2 2 months ago

deleted by creator

MrOtherGuy@lemmy.world · 9 months ago

I’m guessing that the reason (and a good one at that) is that simply having an option to connect to a local chatbot leads to just confused users because they also need the actual chatbot running on their system. If you can set up that, then you can certainly toggle a simple switch in about:config to show the option.

ilhamagh@lemmy.world · 9 months ago

Can you point me to some resources to running smol llm?

My use case prob just to help “typing” miscellaneous idea I have or check for my grammatical error, in english.

Thanks, in advance.

ddh@lemmy.sdf.org · 9 months ago

https://ollama.com/

Sir Arthur V Quackington@lemmy.world · 9 months ago

Here you go: Review of SmolVLM https://www.marktechpost.com/2024/11/26/hugging-face-releases-smolvlm-a-2b-parameter-vision-language-model-for-on-device-inference/

Model itself: https://huggingface.co/spaces/HuggingFaceTB/SmolVLM

And you can use Ollama to run it locally, and Open WebUI to access it in browser.

sinceasdf@lemmy.world · 9 months ago

Idk I noticed pretty significant differences between models of various sizes. I mean there are lots of metrics on this

https://www.vellum.ai/llm-leaderboard

Lojcs@lemm.ee · 9 months ago

Last time I tried using a local llm (about a year ago) it generated only a couple words per second and the answers were barely relevant. Also I don’t see how a local llm can fulfill the glorified search engine role that people use llms for.

Sir Arthur V Quackington@lemmy.world · 9 months ago

Try again. Simplified models take the large ones and pare them down in terms of memory requirements, and can be run off the CPU even. The “smol” model I mentioned is real, and hyperfast.

Llama 3.2 is pretty solid as well.

Lojcs@lemm.ee · edit-2 9 months ago

These are the answers they gave the first time.

Qwencoder is persistent after 6 rerolls.

Anyways, how do I make these use my gpu? ollama logs say the model will fit into vram / offloaing all layers but gpu usage doesn’t change and cpu gets the load. And regardless of the model size vram usage never changes and ram only goes up by couple hundred megabytes. Any advice? (Linux / Nvidia) Edit: it didn’t have cuda enabled apparently, fixed now

Sir Arthur V Quackington@lemmy.world · 9 months ago

Nice.

Yea I don’t trust any AI models for facts, period. They all just lie. Confidently. The smol model there at least tried and got it right at first… Before confusing the sentence context.

Qwen is a good model too. But if you wanted something to run home automation or do text summaroes, smol is solid enough. I’m using CPU so it’s good enough.

TheDorkfromYork@lemm.ee · 9 months ago

They’re fast and high quality now. ChatGPT is the best, but local llms are great, even with 10gb of vram.

fibojoly@sh.itjust.works · 9 months ago

Didn’t want it in Opera, don’t want it in Firefox. I mean they can keep trying and I’ll just keep on ignoring this shit :/

davi@startrek.website · 9 months ago

hopefully, it’ll be possible to opt out somehow.

ToxicWaste@lemm.ee · 9 months ago

as the screenshot shows, it is opt-in

Eiri@lemmy.ca · 9 months ago

I wish I had telemetry on such features.

I really doubt a significant number of people use AI chatbots often enough that having it in a dedicated sidebar is worth it.

Possibly linux@lemmy.zip · 9 months ago

I wish I had telemetry

I’m sure they do as Mozilla is an ad company

davel [he/him]@lemmy.ml · 9 months ago

This is apparently either not widely known or some people just like to shoot the messenger.

jwz, Jun.: Mozilla is an advertising company now
jwz, Oct.: Mozilla’s CEO doubles down on them being an advertising company now
Mozilla support: Share data with Mozilla to help improve Firefox
Firefox documentation: Telemetry

Possibly linux@lemmy.zip · 9 months ago

While you are not wrong your dislike of Mozilla is has more to do with your instance being anti west. I’m not sure I’m ready to side with lemmyml

davel [he/him]@lemmy.ml · edit-2 9 months ago

I happen to know jwz personally, and he knows Mozilla intimately: he founded it. His dislike of Mozilla is pretty much the same as mine, and he is neither “anti west” nor anti liberal. We dislike Mozilla because it has lost its way from being a FOSS browser maintainer and a booster for & steward of an open web.

And I’m not “anti-West,” I’m anti-capitalist, anti-settler-colonialist, and anti-imperialist; and those happen to be things that “the West” presently embodies.

LWD@lemm.ee · edit-2 2 months ago

deleted by creator

davel [he/him]@lemmy.ml · 9 months ago

I’m pretty sure I know what the instance I admin does & doesn’t endorse, thanks.

LWD@lemm.ee · edit-2 2 months ago

deleted by creator

Scrollone@feddit.it · 9 months ago

I think nobody uses AI Chatbots, unless you’re forced to do it. They’re utter shit.

treadful@lemmy.zip · 9 months ago

I’ve never had the urge to use a chat bot personally, but I’m pretty sure I’m in the minority. Lots of people use these things all the time for so much stuff we probably wouldn’t even consider.

I’ve worked with a few people that all but rely on these things to produce any creative work they have to do.

Maybe we run in different circles but I think a lot of people don’t even talk about how they’re using it.

eleitl@lemm.ee · 9 months ago

Thanks for nothing, Mozilla.

Rozaŭtuno@lemmy.blahaj.zone · 9 months ago

They should raise the ceo’s pay some more to celebrate.

piracysails@lemm.ee · edit-2 9 months ago

And fire a few employees just cause.

marcie (she/her)@lemmy.ml · edit-2 9 months ago

why a fucking chatbot? translate a page better for me you fucking losers, all the translation options suck for privacy outside of specifically trained local AIs. this is the BEST use case for a small local LLM yet mozilla with all its brains and resources couldnt rub two neurons together for this.

or they could do character prediction on your typing to make typing faster. just some legit examples, why waste resources to build a chat ai into my browser when i can just open a website???

Midnitte@beehaw.org · 9 months ago

Perhaps Mozilla’s biggest “failure” is just communication…

Firefox actually has this now.

marcie (she/her)@lemmy.ml · 9 months ago

bergamot is ok but leaves a lot to be desired

thingsiplay@beehaw.org · 9 months ago

https://support.mozilla.org/en-US/kb/ai-chatbot

Note that you need an account to use one of these supported systems. HuggingChat allows for a few connections as a gues before cutting the access; basically a trial version, so you have to create an account.

ohwhatfollyisman@lemmy.world · 9 months ago

as someone who’s never dabbled with ai bots, what does this feature do? is it only to query for information like a web search?

Furball@sh.itjust.works · 9 months ago

It just adds ChatGPT or similar to your sidebar. Chatbots can do a lot of things, they are mostly good for information research and technical help, although they have serious flaws like hallucinating false information sometimes

Pup Biru@aussie.zone · 9 months ago

good for information research and technical help

i’d say they are good precursors for information research… never trust them, but use them to find terms to search for reliable sources

LWD@lemm.ee · edit-2 2 months ago

deleted by creator

TheMachineStops@discuss.tchncs.de · edit-2 9 months ago

It gives you many options on what to use, you can use Llama which is offline. Needs to be enabled though about:config > browser.ml.chat.hideLocalhost.

Swedneck@discuss.tchncs.de · 9 months ago

and thus is unavailable to anyone who isn’t a power user, as they will never see a comment like this and about:config would fill them with dread

TheMachineStops@discuss.tchncs.de · edit-2 9 months ago

Lol, that is certainly true and you would need to also set it up manually which even power users might not be able to do. Thankfully there is an easy to follow guide here: https://ai-guide.future.mozilla.org/content/running-llms-locally/.

LWD@lemm.ee · edit-2 2 months ago

deleted by creator

Ephera@lemmy.ml · 9 months ago

From the description in the UI, it does sound like it. Theoretically, a chatbot could be created where you can ask questions about the webpage you have currently opened, so if you don’t want to read a long article, for example. I guess, you could probably just throw a link into an existing chatbot either way, but yeah, direct integration might be convenient either way.

Well, or a chatbot could be created, which has access to your browser history, bookmarks and tabs, so you can ask it when you last saw certain information. However, you’d need a locally running chatbot for that, which makes it more difficult to implement.

festnt@sh.itjust.works · 9 months ago

good question

Sundial@lemm.ee · 9 months ago

Are any of these open source or trustworthy?

1rre@discuss.tchncs.de · 9 months ago

I think Mistral is model-available (ie I’m not sure if they release training data/code but they do release model shape and weights), huggingchat definitely is open source and model-available

thingsiplay@beehaw.org · edit-2 9 months ago

~~Sorry but HuggingChat / HuggingFace and all models on it are not open source~~ (Edit: Oh you meant the UI HuggingChat is Open Source. Yeah sorry, I was focused on the models. And there is no Open Source model from my understanding.) -> https://opensource.org/ai/open-source-ai-definition Off course opensource.org is not the only authority on what the word opensource means, but its not a bad start.

thingsiplay@beehaw.org · 9 months ago

There are no open source ai models, even if they tell you that they are. HuggingFace is the closest thing to as something like open source where you can download ai models to run locally without internet connection. There are applications for that. In Firefox the HuggingChat uses models from HuggingFace, but I think it is running them on a server and does not download from?

The reason why they are not open source is, because we don’t know exactly on what data they are trained on. We cannot rebuild them on our own. And for trustworthy, I assume you are talking about the integration and the software using the models, right? At least it is implemented by Mozilla, so there is (to me) some sort of trust involved. Yes, even after all the bullshit I trust Mozilla.

chicken@lemmy.dbzer0.com · 9 months ago

It’s “open weights” if they are publishing the model file but nothing about its creation. There’s some hypothetical security concerns with training it to give very specific outputs for certain very specific inputs but I feel like that’s one of those kind of far fetched worries especially if you want to use it for chat or summarization and the comparison is getting AI output from a server API. Local is still way better.

festnt@sh.itjust.works · 9 months ago

probably not

fmstrat@lemmy.nowsci.com · 9 months ago

I mean, if you’re going to do it, where’s the Ollama love?

fruitycoder@sh.itjust.works · 9 months ago

I was disappointed there was no local option…

tetris11@lemmy.ml · 9 months ago

I don’t get it, ollama is a provider no?

/home/pineapplelover@lemm.ee · 9 months ago

I think the point is it’s open source

tetris11@lemmy.ml · 9 months ago

and so is firefox, so why use another model provider

fmstrat@lemmy.nowsci.com · 9 months ago

A provider that can be run locally.

Treczoks@lemmy.world · 9 months ago

Luckily, it seems to be disabled by default. At the moment.