I appreciate that you didn’t mean to say what you said, but words mean things. I can only respond to what you say, not what you meant.
Especially here, where the difference entirely changes whether you’re right or not.
Because no, “less human code” doesn’t mean “less AI training”. It could mean a slowdown in how fast you can expand the training dataset, but again, old code doesn’t disappear just because you used it for training before. You don’t need a novel training dataset to train. The same data we have plus a little bit of new data is MORE training data, not less.
And less human code is absolutely not the same thing as “new human code will stop being created”. That’s not even a slip of the tongue, those are entirely different concepts.
There is a big difference between arguing that the pace of improvement will slow down (which is probably true even without any data scarcity) and saying that a lack of new human created code will bring AI training to a halt. That is flat out not a thing.
That this leads to “less developments and advancements in programming in general” is also a wild claim. How many brilliant programmers need to get replaced by AI before that’s true? Which fields are generating “developments and advancements in programming”? Are those fields being targeted by AI replacements? More or less than other fields? Does that come from academia or the private sector? Is the pace of development slowing down specifially in that area? Is AI generating “developments and advancements” of its own? Is it doing so faster or slower than human coders? Not at all?
People say a lot of stuff here. Again, on both sides of the aisle. If you know the answers to any of those questions you shouldn’t be arguing on the Internet, you should be investing in tech stock. Try to do something positive with the money after, too.
I’d say it’s more likely you’re just wildly extrapolating from relatively high level observations, though.
Hm. That’s rolling the argument back a few steps there. None of the stuff we’ve talked about in the past few posts has anything to do with the impact of AI-on-AI training.
I mean, you could stretch the idea and argue that there is a filtering problem to be solved or whatever, but that aside everything I’m saying would still be true if AI training exploded any time it’s accidentally given a “Hello world” written by a machine.
But that point is not the same as LLMs degrading when trained on its own data.
Again, it may be the same as the problem of “how do you separate AI generated data from human generated data”, so a filtering issue.
But it’s not the same as the problem of degradation due to self-training. Which I’m fairly sure you’re also misrepresenting, but I REALLY don’t want to get into that.
But hey, if you don’t want to keep talking about this that’s your prerogative. I just want to make it very clear that the reasons why that’s… just not a thing have nothing to do with training on AI-generated data. Your depiction is a wild extrapolation even if you were right about how poisonous AI-generated data is.
deleted by creator
I appreciate that you didn’t mean to say what you said, but words mean things. I can only respond to what you say, not what you meant.
Especially here, where the difference entirely changes whether you’re right or not.
Because no, “less human code” doesn’t mean “less AI training”. It could mean a slowdown in how fast you can expand the training dataset, but again, old code doesn’t disappear just because you used it for training before. You don’t need a novel training dataset to train. The same data we have plus a little bit of new data is MORE training data, not less.
And less human code is absolutely not the same thing as “new human code will stop being created”. That’s not even a slip of the tongue, those are entirely different concepts.
There is a big difference between arguing that the pace of improvement will slow down (which is probably true even without any data scarcity) and saying that a lack of new human created code will bring AI training to a halt. That is flat out not a thing.
That this leads to “less developments and advancements in programming in general” is also a wild claim. How many brilliant programmers need to get replaced by AI before that’s true? Which fields are generating “developments and advancements in programming”? Are those fields being targeted by AI replacements? More or less than other fields? Does that come from academia or the private sector? Is the pace of development slowing down specifially in that area? Is AI generating “developments and advancements” of its own? Is it doing so faster or slower than human coders? Not at all?
People say a lot of stuff here. Again, on both sides of the aisle. If you know the answers to any of those questions you shouldn’t be arguing on the Internet, you should be investing in tech stock. Try to do something positive with the money after, too.
I’d say it’s more likely you’re just wildly extrapolating from relatively high level observations, though.
deleted by creator
Hm. That’s rolling the argument back a few steps there. None of the stuff we’ve talked about in the past few posts has anything to do with the impact of AI-on-AI training.
I mean, you could stretch the idea and argue that there is a filtering problem to be solved or whatever, but that aside everything I’m saying would still be true if AI training exploded any time it’s accidentally given a “Hello world” written by a machine.
deleted by creator
But that point is not the same as LLMs degrading when trained on its own data.
Again, it may be the same as the problem of “how do you separate AI generated data from human generated data”, so a filtering issue.
But it’s not the same as the problem of degradation due to self-training. Which I’m fairly sure you’re also misrepresenting, but I REALLY don’t want to get into that.
But hey, if you don’t want to keep talking about this that’s your prerogative. I just want to make it very clear that the reasons why that’s… just not a thing have nothing to do with training on AI-generated data. Your depiction is a wild extrapolation even if you were right about how poisonous AI-generated data is.