Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code - Ars Technica

hamburgheftig@feddit.org · 3 天前

Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code - Ars Technica

rockerface🇺🇦@lemmy.cafe · 3 天前

the consensus seems to be that adding instructions to code that sabotage other people’s work goes too far

Luckily, the LLM coding isnt people’s work

teft@piefed.social · 3 天前

the consensus seems to be that adding instructions to code that sabotage other people’s work goes too far

I mean, my thought would be “Don’t fucking run code that you don’t understand”.

Smoogs@lemmy.world · 2 天前

it was always a risk in stack overflow so i dont see why suddenly the world needs to exclusively create safe spaces for all the ‘down with safe spaces’ crowd.

frongt@lemmy.zip · 3 天前

If we all followed that rule, we’d be using nothing more complex than an 8080.

RaphaelSchmitz@feddit.org · 2 天前

The code YOU run. If your code runs other code, that doesn’t fall under this.

“Don’t ride a car unless you know how driving a car works” doesn’t mean you need to understand the chemical composition of the metal in the motor parts

Cocodapuf@lemmy.world · 3 天前

Well, I think it’s legit to use software without understanding the code or use hardware without understanding the specifics of the logical mechanisms of the silicon. But when you’re writing software, you really should know what’s in your own code. Anything else is bad form in my opinion.

AwesomeLowlander@sh.itjust.works · 2 天前

It’s an imported library, since when are devs expected to be inspecting the source code of every library they import?

Cocodapuf@lemmy.world · edit-2 2 天前

I don’t like to use libraries I don’t understand. Probably part why I’m not a professional developer, but it’s the principle of the thing - don’t put out code you can’t vouch for.

I mean, yes, it’s way easier to just use the library, trust it works; but by that logic, it’s also way easier to just let an llm code for you.

Amju Wolf@pawb.social · 2 天前

…but do yoz “understand libraries” by reading every line of their code, or by reading the documentation? And only in the parts you’re actually interested in?

Cocodapuf@lemmy.world · 2 天前

Yeah, a general understanding is enough. But I think yeah, actually skim over the code, at least get a basic idea about how the internal methods work. Depending on what you’re using the library for, it could be prudent to know more about how data structures are handled.

Honestly, you’ll probably learn something in the process.

AwesomeLowlander@sh.itjust.works · 2 天前

Probably part why I’m not a professional developer, but it’s the principle of the thing

There’s no ‘principle’ here, that’s something that simply would not be possible in any sort of large project. To suggest all professional software developers read every line of every library before using it is ridiculously unworkable.

Cocodapuf@lemmy.world · 2 天前

deleted by creator

AwesomeLowlander@sh.itjust.works · 2 天前

? Do you have me confused with somebody else?

mabeledo@lemmy.world · edit-2 1 天前

Libraries can be audited. LLM generated code cannot.

Edit: to clarify, it is impossible to audit all LLM generated code across a number of projects, that would replace a single library. It simply won’t happen, because there will always be a non trivial number of users who will copy and paste code without inspecting it. In contrast, widely used open source libraries may be audited by a small subset of their users, and the rest would benefit from that.

Jakeroxs@sh.itjust.works · 2 天前

Yes it can, its literally still code.

mabeledo@lemmy.world · 2 天前

I know it’s code. You are missing the point.

Any library with a critical user mass is auditable, because a fraction of those users would take the time to do so, whereas all LLM generated variations of the same library cannot and will never be auditable.

this@sh.itjust.works · 3 天前

True, but I would think developers should at least be following it with the code they’re actually working on.

AwesomeLowlander@sh.itjust.works · 2 天前

It’s an imported library, since when are devs expected to be inspecting the source code of every library they import?

yessikg@fedia.io · 2 天前

Since forever? Don’t you do security audits on the libraries you use?

AwesomeLowlander@sh.itjust.works · 2 天前

One person from the team, maybe. You don’t have every single dev read every line of code in the libraries, which is what is being specified here

sakuraba@lemmy.ml · 2 天前

it used to be a thing but javascript npm brainrot happened

grue@lemmy.world · 3 天前

Reminds me of https://www.youtube.com/watch?v=OPKGbg16ulU (and also https://www.youtube.com/channel/UCS0N5baNlQWJCUrhCEo8WlA)

Lucidlethargy@sh.itjust.works · 3 天前

I’m a developer, and I support this message.

Fuck all LLM created content. Fuck it all. Burn it all down, my friends.

sunbytes@lemmy.world · 2 天前

So long as the person is using some form of version control, it’s effectively just a slap on the wrist.

Rothe@piefed.social · 3 天前

It’s the stolen work of other people.

Jakeroxs@sh.itjust.works · 2 天前

Like all of human knowledge, I swear you antillm people are out of your mind.

Here we have a way to bring coding and creation to the masses at a much lower bar and most of the LLM projects I see are MIT licensed, it’s literally a revolution for open source but half of you are pearl clutching and acting like god damn Microsoft.

mabeledo@lemmy.world · 2 天前

You are missing the most important questions here: who can afford it, and who owns it.

It’s easy to be pro LLM when $20 a month is not a big deal.

Jakeroxs@sh.itjust.works · 2 天前

Self host an open model, but yeah 20 a month is not that expensive for what you can do with it.

But that’s not what anyone in this thread is saying, they’re saying LLM code bad and stealing so let’s poison open source projects. Also sharing code is bad now, when I’m sure many of these people would claim they like open source code.

Again, I think knowledge and code should be free for all to use so that we all benefit from it.

0xSim@lemdro.id · 2 天前

“self host an open model”. My dude, you need pretty beefy hardware to run a slow and shit model that won’t even compare to the 0.33x models you get with a copilot subscription.

Jakeroxs@sh.itjust.works · 19 小时前

Its getting better all the time, its crazy how much better consumer level hardware can run competent models (even if it’s lower params) these days compared to just 6 months ago.

mabeledo@lemmy.world · 2 天前

I figured you wouldn’t be able to look past your own personal experience. I’m sorry to say that most people outside your bubble cannot afford either the subscription nor the hardware to run usable LLMs locally.

“Sharing code is bad now” because a handful of companies scraped it and not only they haven’t given anything back, they are reselling it in different shapes, and telling people that now all that data is proprietary. So, yes, stolen is an apt word for it.

Anyway, all this talk about “democratizing” knowledge is bullshit. Libraries democratized knowledge. The internet democratized knowledge. Anyone can learn how to code if they put the time and read a book and practice.

But delegated thinking is the opposite of acquiring knowledge, so what the hell are you people yapping about.

Jakeroxs@sh.itjust.works · 19 小时前

You don’t have to delegate thinking, I’m sure many people will but it’s absolutely not a requirement for using LLMs as the intended tool they are.

On the topic of price, I’m sure people were saying the same things about books (oh must be nice you can afford books), then the same about computers and the internet. They eventually became more affordable.

Not even going to touch the “I couldn’t understand economic heardship” aspect.

mabeledo@lemmy.world · 8 小时前

You are betting on massive corporations having a change of heart and putting all their resources at the disposition of the public, for essentially free. Otherwise, AI will never be affordable in the sense that everyone could have free access to models that matter.

And I know that you said that self hosting is a possibility. But let’s be real here: public weight models are available because they pose no risk to the bottom line of the companies training them. There are zero competitive models trained by a non profit. But even if that wasn’t true, the current DRAM shortage is proof that these companies will never allow anyone to match them. Same goes for electricity and water.

Honestly, after all these years of witnessing big tech shitting all over us, I cannot understand where all these hopes come from. Would be endearing if it wasn’t so reckless.

Jakeroxs@sh.itjust.works · 7 小时前

I’m just showing that as technology progresses and scales it generally becomes cheaper and peoples access increases, again were literally on the internet now and have phones in our pockets that can do it, whereas 40 years ago PCs were much more expensive and internet was slow as hell.

We shouldn’t trust big tech, I’m on Lemmy so that should be a bit of a given lol.

Billegh@lemmy.world · 3 天前

I think that’s the problem though, isn’t it. It is other people’s work, condensed down into what could semi-accurately be called a statistics based random word generator. If LLMs were good at it or had people checking behind then that were good we wouldn’t be in this mess in the first place.

rockerface🇺🇦@lemmy.cafe · 3 天前

I meant more the process of generating code via LLM isn’t work. The end result ultimately uses someone else’s work, yes, but the process can be and should be sabotaged.