Who owns the code Claude Code wrote?

161

ssenaevren about 9 hours ago 169 commentsRead Article on legallayer.substack.com

FR version is available. Content is displayed in original English for accuracy.

⚡ Community Insights

Discussion Sentiment

45% Positive

Analyzed from 6593 words in the discussion.

Discussion (169 Comments)Read Original on HackerNews

semiquaver•about 1 hour ago

  > The US Copyright Office confirmed this in January 2025, and the Supreme Court declined to disturb it in March 2026 when it turned away the Thaler appeal. Works predominantly generated by AI without meaningful human authorship are not eligible for copyright protection, and that rule is now settled at the highest judicial level available.

Misstates the law. Denial of certiorari can happen for many reasons unrelated to the merits and does not settle the issue nationwide.

senaevren•about 1 hour ago

Fair and correct. Cert denial means the Court declined to hear the case, not that it endorsed the lower court's reasoning or settled the question nationally. The DC Circuit ruling stands and the Copyright Office's position is consistent, but that is stable doctrine rather than Supreme Court-settled law. Updated the piece to reflect this distinction accurately.

greensoap•about 1 hour ago

Also, I don't think there is any example testing the conclusion. There is no case to point at that any of the factors they listed are sufficient to convey authorship. Would love to be pointed to a case where rejecting decisions and redirecting to a different approach was deemed human authorship. What we do know is that you can disclaim the part of the code a human didn't author. In fact, the Copyright Office requires you disclose and disclaim. If anyone out there has more factual and citable sources please share.

senaevren•30 minutes ago

You are right that no court has yet ruled that a specific set of human contributions to AI-assisted work was sufficient to establish authorship. What exists is the inverse: the Copyright Office has granted partial registrations where human-authored elements were separated from AI-generated elements, as in Zarya of the Dawn, where the human-written text was protected but the Midjourney images were not. The Allen v. Perlmutter case pending in Colorado is the first direct judicial test of whether iterative prompting and editing can constitute authorship. Until that decision, the positive threshold is genuinely unknown. The piece reflects this in the calibration section at the end, though your point is worth adding to the authorship discussion more explicitly.

jmyeet•37 minutes ago

The Supreme Court declining to take up an issue is taking a position.

Now different circuits can take a different view of the same issue. This is a common reason why the Supreme Court will grant cert: to resolve a circuit split. Appeals court judges know this and have at times (allegedly) intentnionally split to force an issue to the Supreme Court.

Even without settling the issue appeals courts will look at how other circuits have ruled and be guided by their reasoning, generally. The fact that the Supreme Court declined to grant cert actually carries weight.

semiquaver•3 minutes ago

  > The Supreme Court declining to take up an issue is taking a position.

No it is not.

  > “The denial of a writ of certiorari imports no expression of opinion upon the merits of the case, as the bar has been told many times.”

United States v. Carver, 260 U. S. 482, 490 (1923).

senaevren•28 minutes ago

Fair point and worth being precise about. Cert denial is not meaningless: it leaves the lower court ruling intact, it signals the Court did not find the issue urgent enough to resolve now, and as you note, other circuits will look at the DC Circuit's reasoning. What it does not do is bind other circuits or establish Supreme Court precedent. The distinction matters here because if a Ninth Circuit case involving AI-generated code reaches a different conclusion, that circuit split would be live law regardless of the Thaler cert denial.

greensoap•34 minutes ago

the real issue is that the Thaler case was a different question: "Can AI be an author?" and the lower Court said no and SCOTUS left it along. But the question of "what is enough for the human to be the author" wasn't even part of the case. That is completely own checked.

senaevren•21 minutes ago

Correct on both counts. Thaler presented the narrowest possible version of the question: zero human involvement, AI listed as sole author, and there was no claim of any human creative contribution. The DC Circuit answered that specific question and only that question. The harder question, what level of human direction is sufficient to establish authorship, was explicitly left open. The court noted it but did not reach it because it did not need to. Allen v. Perlmutter in Colorado is the first case that actually presents that question to a federal court, with 624 prompts, iterative refinement, and Photoshop edits in the fact pattern. The Copyright Office acknowledged the Photoshop edits as human-authored but still denied registration for the AI-generated underlying elements, which means even that case may not give a clean answer on prompting alone. Updated the piece to reflect this distinction. The Zarya of the Dawn partial registration is currently the closest thing to positive guidance that human-authored elements of an AI-assisted work can be separately protected even where the AI-generated elements cannot.

Arcuru•about 3 hours ago

Personally, I think that the human directing the agent owns the copyright for whatever is produced, but the ability for the agent to build it in the first place is based off of stolen IP.

I'm concerned about the copyright 'washing' this enables though, especially in OSS, and I think the right thing for OSS devs to do is to try to publish resulting code with the strongest copyleft licensing that they are comfortable with - https://jackson.dev/post/moral-ai-licensing/

jugg1es•about 8 hours ago

I want this question to have an interesting answer, but everyone knows that if this question ever goes to the courts, ownership will go to the people in charge with the money. The idea that Anthropic may not own Claude Code just because Claude wrote it is wishful thinking.

embedding-shape•about 8 hours ago

Best part is, it's likely to have a different answer in every country, who knows what'll happen, not every country implicitly sides with the ones with the most money.

MarsIronPI•about 2 hours ago

Well, eventually it'll probably be added to the Berne Convention agreement or some such.

adrianN•about 3 hours ago

Depends on where they pay their taxes generally.

senaevren•about 8 hours ago

The work-for-hire doctrine actually supports your intuition more than the AI authorship question does. The reason Anthropic likely owns Claude Code has little to do with whether Claude wrote it and everything to do with the employment contracts of the engineers who directed it. The DMCA takedown question is genuinely interesting though because DMCA requires the claimant to assert copyright ownership in good faith. If a court later found the codebase was predominantly AI-authored and therefore not copyrightable, the 8,000 takedowns could be challenged as bad faith DMCA claims. That is a different and more tractable legal question than the ownership one.

gpm•about 2 hours ago

I have trouble believing that the DMCA claims would be found to be in bad faith when they were made at a time when the question of what degree of human input is required to acquire copyright on AI generate code hasn't been resolved at all.

It doesn't seem like bad faith to think that copyright is stronger than the courts end up thinking, just being mistaken.

rasz•about 7 hours ago

Work-for-hire doctrine doesnt automagically absolve you from IP law. Microsoft and Intel already learned this in the nineties when they paid San Francisco Canyon Company to steal Apple code.

https://en.wikipedia.org/wiki/San_Francisco_Canyon_Company

LLMs are just code stealers, will gladly generate Carmacks inverse for you with original comments.

beej71•about 6 hours ago

I love that genAI art will not be copyrightable and genAI code will be. The power of the Almighty Dollar at work.

conartist6•about 8 hours ago

It's not wishful thinking, and ownership isn't a foregone conclusion.

Sure the courts could mint a communist society with a few weird decisions about property rights, but this being the US do you really suppose that's likely?

There's really no legal question of any kind that models aren't people and therefore cannot own property (and also cannot enter into legal contract as would be required to reassign the intellectual property they don't and can't own)

wongarsu•about 8 hours ago

The catch-22 is that the fact that models aren't people is only relevant if you treat them similar to a person. Like the US Copyright Office's opinion which treats it similar to a freelancer. If you treat the LLM as a machine similar to a camera, with the author expressing their existing intent through the tools of this machine, ownership is back on the table and more or less how it was before LLMs.

conartist6•about 8 hours ago

Well if the camera in addition to choosing autoexposure also decided how to frame the shots, which lens to use, where to stand, and everything else salient to the artistry of photography -- all without direct human intervention, then I would think the situation would again be analogous. If the camera could do all that because an intern was holding it, the intern would still own the shots even if their employer gave them the assignment.

That's why the intern signs an employment contract that reassigns their rights to their employer!!

p0w3n3d•about 8 hours ago

That's quite impressive approach from the companies' perspective. Let's first use claude code and then we'll think who the code belongs to.

I think that the gold rush approach happening right now around me (my company EMs forcing me to work with claude as fast as possible) show really short-sight of all the management people.

First - I lose my understanding of the code base by relying too much on claude code.

Second - we drop all the good coding practices (like XP, code review etc.) because claude is reviewing claude's code.

Third - we just take a big smelly dump on the teamwork - it's easier and cheaper to let one developer drive the whole change from backend to frontend, despite there are (or were) two different teams - one for FE, one for BE.

Fourth - code commenting was passe, as the code is documentation itself... Unless... there is a problem with the context (which is). So when the people were writing the code, they would not understand the over-engineered code because of their fault. But now we make a step back for our beloved claude because it has small context... It's unfair treatment.

I could go on and on. And all those cultural changes are because of money. So I dub this "goldrush", open my popcorn and see what happens next.

nicoburns•about 8 hours ago

> Third - we just take a big smelly dump on the teamwork - it's easier and cheaper to let one developer drive the whole change from backend to frontend, despite there are (or were) two different teams - one for FE, one for BE.

Agree with your other points, but IMO this one has always been better. You often need to design the backend and frontend to work with each other, and that requires a lot more coordination when it's separate teams.

senaevren•about 7 hours ago

The fourth point about code commenting is the one that connects directly to the ownership question. When developers write comments to explain intent, those comments are evidence of human creative direction. When Claude writes the code and the comments, and the developer merges without adding their own explanation of the architectural decisions, the record of human authorship disappears along with the institutional knowledge. The documentation problem and the copyright problem are the same problem.

sebastianconcpt•about 8 hours ago

Also, it's supremely easy do the wrong abstractions long term and compromise premature internal designs that will start to starve of human mental modeling, hence explaining with accountability how things work and what the plans are when an incident happens. Also, if the wrong generalizations are introduced, coded correctly and reviewed and approved by AIs, then who's even driving really?

bearjaws•about 8 hours ago

I rarely see #3 yield better solutions, it's usually better to collaborate as a team on requirements and gotchas, but let one person own implementation.

qsera•about 3 hours ago

More interesting question is "Who wants to own it"...

The answer is probably "Nobody"!

nine_k•about 3 hours ago

Depending on the scale. If you ask Clause to one-shot an app from a nebulous description, you get a prototype which you would understandably loathe to own the code of. If you plan carefully and limit the scope, you get code that you understand, can approve of, and are okay owning further down the line.

burnte•about 3 hours ago

I spent two and a half hours writing up a detailed outline for a small webapp. Claude popped it out in one shot 100% working., I added features after but the time you spend on a good outline saves hours later.

onlyrealcuzzo•about 3 hours ago

Presumably, every company that has non-LGPL CC code in production wants to own it...

nine_k•about 3 hours ago

"Own" as in "be responsible for". Nobody is too keen to own a pile of semi-working trash, and extensive vide-coding can produce such piles easily.

curt15•about 3 hours ago

Not sure why this is being down voted. Outsourcing work doesn't also outsource accountability.

qsera•about 3 hours ago

Yea, that is how I meant it.

ottah•about 3 hours ago

My opinion, copyright has mattered very little in the corporate world. Copyright is effectively meaningless with SaaS, and the compiled software ran on your machine is protected more by technical controls and EULAs. A world where copyright didn't exist for software would look nearly the same for the commercial world. Trade secrets, NDAs, and employment contracts bind workers more than copyright. The only thing that the question of copyright has real world impact is open source, but even then only for more restrictive licenses such as gpl.

_flux•about 8 hours ago

I think it should be pretty clear that if you provided the tool the specification for the code you want, you have already provided creative input.

After all, is this not what happens with compilers as well? LLM agents are just quite advanced compilers that don't require the specification to be as detailed as with traditional compilers.

senaevren•about 8 hours ago

The compiler analogy is the right one to reach for and the Copyright Office addressed it directly: the question is not whether you provided input, it is whether the creative expression in the output reflects human authorship. With a traditional compiler, the programmer authors every expression in the source. With an LLM, the programmer authors the intent and the model makes the expressive decisions about structure, naming, pattern, and implementation. Whether that distinction matters legally is what Allen v. Perlmutter is working through right now. The summary judgment briefing completed in early 2026 and it may be the next landmark ruling on exactly this question.

yodon•about 8 hours ago

>it should be pretty clear that if you provided the tool the specification for the code you want, you have already provided creative input.

If you provided a human contractor with the specifications for the code you want, the courts have repeatedly made clear you have not provided the creative input from a copyright perspective, and the contractor needs to explicitly assign those rights to you if want to own the copyright on the code.

_flux•about 5 hours ago

Let's say we didn't have assemblers, but instead we would have three professions:

- Specifiers, who make the specification for the system

- Programmers, who write C code

- Machine encoders, that take that C code and write machine code for a CPU

Would it be that the copyright would then belong to programmers, if no other explicit assignments would be made?

---

Thinking about it, probably yes: copyright of the spec belongs to specifies, copyright of the C belong to programmers, and copyright of machine code to machine encoders. Or would it depend on the amount of optimizations the machine encoders would do, i.e. is it creative or not? And then does this relate to the task and copyrightability of C compiler output, where optimizations can sometimes surprise the developer?

hypercube33•about 8 hours ago

To me this is like asking who owns the binary files a compiler generates.

bko•about 8 hours ago

This is all well and good as an intellectual exercise, but in real life none of this matters. Almost no one thinks their code is copyrightable or seriously thinks their code is a moat. I've written the same chunks of code for a number of employers as has every engineer. We've all taken chunks from stack overflow and other places without carefully considering attribution.

This comes up in a few places as a kind of vindictive battle. One example is Oracle suing Google for too closely mimicking their API in Android. Here is an example:

> private static void rangeCheck(int arrayLen, int fromIndex, int toIndex) {

    if (fromIndex > toIndex)

        throw new IllegalArgumentException("fromIndex(" +

fromIndex +

                                           ") > toIndex(" +

toIndex + ")");

    if (fromIndex < 0)

        throw new ArrayIndexOutOfBoundsException(fromIndex);

    if (toIndex > arrayLen)

        throw new ArrayIndexOutOfBoundsException(toIndex);

}

And it was deemed fair use by the Supreme Court. Other times high frequency hedge funds sued exiting employees, sometimes successfully. In America, anyone can sue you for any reason, so sure, you'll have Ellison take a feud up with Page and Brin all the way up to the Supreme Court.

In 99.9% of instances none of this matter. Sure there's the technical letter of the law but in practice, and especially now, none of this matters.

https://www.supremecourt.gov/opinions/20pdf/18-956_d18f.pdf

freedomben•about 8 hours ago

> Almost no one thinks their code is copyrightable or seriously thinks their code is a moat.

You'd be surprised! Among non-software management types, they often think of the code as extremely valuable IP and a trade secret. I'm a CTO and I've made comments before to non/less technical peers about how the code (generally speaking) isn't that big of a secret, and I routinely get shocked expressions. In one case the company almost passed on a big contract because it required disclosure of the source code (with an NDA). When I told them that was a silly reason and explained why, they got it, but the old way of thinking still permeates and is a hard habit to break.

Edit: Fixed errant copy pasta error. Glad that wasn't a password :-)

bko•about 8 hours ago

You're right, I guess maybe I mean in any serious actionable way. Senior, non technical people leave plenty of money on the table by thinking they're protecting something valuable or they have some kind of secret sauce. It's all silly is what I meant to say, and digging into the technicalities of whether your code is truly copyrightable is kind of pointless. It's all vibes.

senaevren•about 7 hours ago

The place where it concretely matters is M&A due diligence. Acquirers are now routinely asking about AI tool usage in development and running license scans as a condition of closing. A codebase that cannot demonstrate human authorship over its core IP, or that contains GPL contamination, creates a representation and warranty problem in the purchase agreement. For most companies day to day you are right. For the companies that get acquired or raise institutional capital, the question becomes very concrete very quickly.

hackingonempty•about 7 hours ago

Maybe LLM coding agents change the equation by making it much easier to adapt and use foreign and probably incomplete code. Getting you closer to competing with the original authors in a shorter amount of time than generating new code from scratch.

conartist6•about 8 hours ago

Nobody ever talks about convergence.

You, right now, are taking about convergence.

If there is no artwork, there can be no copyright. If every character of the code to write is basically predetermined by the APIs you need to call, there is no artwork and no copyright.

Build a novel new API, and you'll be protected though.

sarchertech•about 7 hours ago

> Almost no one thinks their code is copyrightable

Every open source license is built on the premise that code is copyrightable.

adrian_b•about 3 hours ago

No.

It is based on the premise that if the proprietary licenses are valid, then also the open source licenses are valid.

So what is held as true is only the implication stated above and not the truth value of the claims that either kind of licenses are valid.

If the proprietary licenses are not valid, then it does not matter that also the open source licenses are not valid.

The open source licenses are intended as defenses against the people who would otherwise attempt to claim ownership of that code and apply a proprietary license to the code, i.e. exactly what now Anthropic and the like have done, together with their corporate customers.

Of course, if it is accepted that the code generated by an AI coding assistant is not copyrightable, then using it would not really be a violation of the original open source licenses. The problem is that even if this principle is the one accepted legally, at least for now, both Anthropic and their corporate customers appear to assume that they own the copyright for this code that should have been either non-copyrightable or governed by the original licenses of the code used for training.

Rietty•about 8 hours ago

Why were the HFT firms suing employees?

Nursie•about 8 hours ago

> Almost no one thinks their code is copyrightable

I think this is an unusual opinion.

Code may not be copyrightable in as small chunks as you put there, but in terms of larger pieces I think companies and individuals very often labour under the belief that code is intellectual property under copyright law.

If code isn't copyrightable, from where comes the GPL?

And why does anyone care if (for instance) some Microsoft code might have accidentally ended up in ReactOS, causing that project to need to go into a locked-down review mode for months or years? For that matter why do employers assert that they own the copyright in contracts?

I think it's the opposite - almost everyone thinks their code is copyrightable, outside of APIs and interop stuff, or things so simple as to be trivial.

croes•about 8 hours ago

> Almost no one thinks their code is copyrightable

Then why does reverse engineered code need to be a clean room implementation?

Ask any emulator developer or the developers of ReactOS

https://reactos.org/forum/viewtopic.php?t=21740

hmokiguess•about 2 hours ago

Tangential but I find this an interesting parallel from a few years ago:

https://www.vice.com/en/article/musicians-algorithmically-ge...

jhbadger•about 9 hours ago

This is of course assuming you take AI-generated code unchanged. But you don't, in my experience. And that generates a new work fully copyrightable even if the original wasn't. Just like how the fad a decade or so ago of taking Tolstoy and Jane Austen works and adding new elements -- "Android Karenina" and "Sense and Sensibility and Sea Monsters" are copyrighted works even if the majority of the text in them was from public domain sources.

FartyMcFarter•about 8 hours ago

The article addresses this explicitly:

> Works predominantly generated by AI without meaningful human authorship are not eligible for copyright protection

Note the word "predominantly", and the discussion that follows in the article about what the courts and the copyright office said.

wongarsu•about 8 hours ago

Skimming over the article, it's a lot about what the copyright office said and very little about what courts said. But the opinion of the copyright office doesn't have any legal force. Regulations passed by the copyright office would be binding, but their opinions are just opinions. We will have to wait until relevant court cases reach a conclusion. And so far running litigation isn't even about that question, it's about infringing the rights of works that are in the training data

Luker88•about 8 hours ago

No such assumption is made in the article.

Nor does it give a single answer.

Mere prompting is still not enough for copyright, and the problem is unsolved on how much contribution a human needs to make to the generated code.

In the case for generated images copyright has been assigned only to the human-modified parts.

Even worse, it will be slightly different in other nations.

The only one that accepts copyright for the unchanged output of a prompt is China.

ModernMech•about 8 hours ago

Here's a question I have: if the AI generated image is of a character of which you own the IP, don't you have protections based on the character regardless of who gets copyright protections from authorship of the image?

sarchertech•about 7 hours ago

Yeah if you have a copyright on the character, the AI generated image doesn’t change that. It doesn’t give you more of less protection than you already had.

conartist6•about 8 hours ago

I'm sure it's not quite that simple. Only parts the parts of those knock-off works that aren't public domain could be copyrightable. If you only own the copyright to ten lines in a 10k line codebase, then it's probably fair use for someone else to just to take the whole thing.

Plus what if Anna Karenina was GPL?

brianwawok•about 8 hours ago

You use humans to edit AI code? When you level up you are just using AI to write, AI to review, AI to edit, AI to test. Not a lot of steps left for meat bags.

mathgeek•about 8 hours ago

You're forgetting that you need coffee/tea/mate to fuel the button pushers. The Jetsons predicted this decades ago.

gchamonlive•about 8 hours ago

AI for review is terrible, and by no fault of their own. It's our job to specify and document intention, domain and the right problems to solve, and that is just hard to do. No getting around it. That's job security for us meat bags.

ModernMech•about 8 hours ago

AI to write - code is buggy and not what I asked for

AI to review - shallow minutia and bikeshedding

AI to edit - wrote duplicated functions that already existed

AI to test - special casing and disabling code to pass the narrow tests it wrote

AI report - "Everything looks good, ship it!"

throwatdem12311•about 8 hours ago

Ok what about all the Anthropic’s engineers who say they don’t write code at all and it’s 100% AI-generated?

gchamonlive•about 8 hours ago

> This is of course assuming you take AI-generated code unchanged.

How much code do you need to change in order for it to be original? One line? 10%? More than 50%?

That's arbitrary and quite unproductive convo to be honest.

ninkendo•about 8 hours ago

> That's arbitrary and quite unproductive convo to be honest.

Yeah but that’s what the legal system ostensibly does. Splitting fine hairs over whether a derived work is “transformative” is something lawyers and judges have been arguing and deciding for centuries. Just because it’s hard to define a bright red line, doesn’t mean the decision is arbitrary. Courts will mull over whether a dotted quarter note on the fourth bar of a melody constitutes an independent work all day long. It seems absurd, but deciding blurry lines are what courts are built to handle.

gchamonlive•about 6 hours ago

EDIT: I changed my argument completely.

That makes no sense because what if you refactor your code ad infinitum using AI? You spin up a working implementation, then read through the code, catalog the changes like interface, docs, code quality and patterns and delegate to the AI to write what you would.

It's 100% AI code and it's 100% human code. That distinction is what's counterproductive.

stvltvs•about 7 hours ago

Because at the end of the day, someone has to own the code, so some lines have to be drawn no matter how arbitrary they seem.

exe34•about 3 hours ago

> This is of course assuming you take AI-generated code unchanged. But you don't, in my experience. And that generates a new work fully copyrightable even if the original wasn't.

That's not how copyright works. The modified version is derivative. You can't just take the Linux kernel, make some changes, and slap a new license on it.

mzl•about 6 hours ago

If you modify the work, that creates a derived work from whatever copyright the original works has, not a new work that is fully copyrightable.

As the article says in the Tl;DR at the top the code may be contaminated by open source licenses

> Agentic coding tools like Claude Code, Cursor, and Codex generate code that may be uncopyrightable, owned by your employer, or contaminated by open source licenses you cannot see

6stringmerc•about 7 hours ago

Wrong. This territory was heavily covered in music before this code concept - it has to be “transformative” in the eyes of the law. Even going in and cleaning up code or adding 10-25% new code won’t pass this threshold. Don't bother arguing with me on this, just accept reality and deal with it.

jhbadger•about 5 hours ago

My copy of "Sense and Sensibility and Sea Monsters" is explicitly listed as being copyrighted by Ben H. Winters in 2009 despite the majority of the words being Austen's, though. Perhaps music has different rules compared to text. I suspect Winters and his publisher have investigated the legality of this more than either of us have.

acdha•about 3 hours ago

Jane Austen died long enough ago that her works are in the public domain, so Winters did not need a license to use it. That does not mean that he gained rights to her work: if he tried to sue someone for use of anything which appeared in the original, he would lose in court because it’s easy to show that copies made before he was born had the same text. This also how they prevent people trying to extend copyright by making minor changes to an existing work: the new copyright only covers the additions.

There’s a very accessible summary of the United States rules here:

https://www.copyright.gov/circs/circ14.pdf

metalcrow•about 3 hours ago

"if Claude was trained on the LGPL-licensed codebase and its output reflects patterns learned from that code, can the output be treated as license-free? The emerging legal consensus is probably not, and assuming it can creates significant liability for anyone shipping that code commercially."

Is there any citation for this "legal consensus"? I was not aware there was any evidence backed stances on this topic as of yet

onlyrealcuzzo•about 3 hours ago

This sounds like a problem that's pretty easy to get around.

CC does not need LGPL code. There's more than enough BSD and Apache code to go around.

And they can generate synthetic data that is better than LGPL for their training.

It's also a problem that does not seem feasible to meaningfully enforce.

It's easy to generate CC code and lie and say you didn't. It would be hard to prove that you did, especially if you took any precautions to make it even slightly difficult that you did.

adrian_b•about 3 hours ago

Unlike GPL, BSD and Apache licenses do not claim to also cover your non-AI-generated code that only invokes the AI-generated code.

However, even if the BSD/Apache/MIT licensed code can be incorporated freely in your application, you still have no right to remove the copyright notices from it and/or to claim that you own the copyright for it.

Therefore, unless the AI model has been trained only on non-copyrighted public-domain code, incorporating the generated code in your application means that you have removed the copyright notices from it, which is not allowed by the original licenses.

There is absolutely no doubt that using an AI coding assistant works around the copyright laws, but it is still equivalent with doing copy and paste with fragments from copyrighted works into your source code.

I consider that copyright should not be applicable to program sources, at least not in its current form, so reusing parts from other programs should be fair use, but only if human programmers would be allowed to do the same.

zuzululu•about 3 hours ago

I think it's pretty clear cut, whoever is paying for your agentic coding tool subscription is part of the litmus test.

I use my own computer, I pay for my own subscription and I build my open source projects then the code belongs to me.

If I use my company's computer, they pay for my subscription and we work on the company's projects then the code belongs to the company.

In any step of the way if some copy-left or any other form of exotic open source license is violated, who pays for discovery? Is it someone in Russia who created a popular OSS library that is now owed? How will it be enforced?

briandw•about 1 hour ago

Your employer can claim your code if you use their tools to produce it. Nothing new here. This has nothing to do with AI tooling.

mlmonkey•about 2 hours ago

On a related note, another question: who owns the paper that Claude (or OpenAI) wrote? Should such paper submissions in conferences call out the model(s) used to write the paper itself?

daishi55•about 8 hours ago

I’m no lawyer but I feel that meta, my employer, wouldn’t be letting us go hog-wild with Claude code if they weren’t completely confident that they fully owned the outputs, whether we change it or not.

senaevren•about 7 hours ago

Meta's confidence almost certainly rests on the employment contracts and IP assignment clauses, not on a legal theory that AI output is inherently copyrightable. The enterprise agreement with Anthropic assigns outputs to the licensee. The employment contract assigns work product to Meta. Those two documents together give Meta a defensible ownership position regardless of the authorship question. The interesting gap is for developers using personal accounts or consumer plans on side projects, where neither of those documents exists.

beej71•about 6 hours ago

I don't understand how a company can have IP copyright rights on code that is inherently uncopyrightable (in the unlikely event scotus rules that way).

sarchertech•about 7 hours ago

There’s so much FOMO right now around AI that no one is thinking clearly. I wouldn’t be so confident in your company.

user34283•about 2 hours ago

To evaluate the legal risks of using AI generated code, let’s consider how many lawsuits there have been over these concerns.

Inadvertent copyleft license violations: probably 0 lawsuits

Competitor copied your software, you could not defend your rights in court because it was made with AI: probably also 0

Users of agentic AI for software development: >10 million

The thinking here seems pretty clear to me.

pfortuny•about 2 hours ago

You don't but nevertheless you bear the responsibility of making it public (whether in soyrce or binary form). That is what Anthropic would like.

gspr•about 1 hour ago

I'm still flabbergasted that people – and big, visible companies with big targets on their backs – choose to keep on using the output of LLMs without having an answer to these questions.

And I'm worried that once that has been sufficiently normalized, laws and interpretations of them will adapt to whatever best suits those users. Which will mean copyrightwashing of FOSS. My only hope then is that surely if free software can be copyright-washed by the big guys, then so can the little guy copyright-wash the big guys' blockbuster movies or whatever, which might lead to some sort of reckoning.

Isamu•about 1 hour ago

Copyright has a lot to do with what we as a society want to protect and encourage. We want to protect an author that put the hours into creating a book, as opposed to the person creating a copy of that work. The person copying can claim they put in work too but the claim is not strong enough to override our preference to protect original authors.

Part of the problem with generated works is that it is lower effort like the person copying something. It’s not an activity that demands special protection like original authorship. I believe this is a large part of the reasoning.

torben-friis•about 1 hour ago

AI is a monster to our current copyright system - monster in the philosophical sense, that is, an example that destroys the concept.

First, its creation is (claimed to be) extremely useful for society, but in order to be created it requires ignoring copyright for pretty much everything ever written. Something we kinda shrugged under the table.

Then, it introduces an extreme jump down in creation effort - so if the focus is protection of effortful creation, nothing with AI use qualifies. But of course, you'd want society to benefit from effortlessness in general, spending more effort than needed in a task is the opposite of efficiency.

6d6b73•about 1 hour ago

LLMs are just tools we use. If I program an app in C++, do I not own the rights to the executable because my compiler wrote machine code for me?

TheFirstNubian•about 7 hours ago

The elephant in the room, of course, is what constitutes “meaningful human authorship.” However, I cannot shake off the feeling that all user interactions with these AI models are being logged. Perhaps this may turn out to be the bigger concern in a potential legal battle than code authorship.

palata•about 8 hours ago

One question I have is this: if an employee produces code predominantly generated by AI, it means that it is not copyrightable. Does that mean that the employee can take that code and publish it on the Internet?

Or is it still IP even if it is not copyrightable? That would feel weird: if it's in the public domain, then it's not IP, is it?

senaevren•about 7 hours ago

That is exactly the right question and the answer is genuinely strange. Uncopyrightable work falls into the public domain, which means anyone can use it, copy it, or build on it freely. The employer can still call it a trade secret and protect it through confidentiality obligations in employment contracts, but that protection is contractual rather than property-based. A trade secret loses protection the moment it is disclosed. So the employer's claim over purely AI-generated code is essentially: "you cannot share this" rather than "we own this." Those are meaningfully different legal positions, and most companies have not thought through which one they actually have.

zvr•about 6 hours ago

Yes, and if the same come ends up in someone else's hands, they can state "we didn't steal it, a GenAI generated it for us, the same as it did for you". Given the non-deterministic operation of current GenAI systems (a major difference from compilers), it would probably be hard to prove either position.

palata•about 7 hours ago

So employees are not allowed to distribute the code, but if it leaks, then it is public and the company cannot do anything about it. Correct? That's what happened to Anthropic I think?

BlackFly•about 8 hours ago

A recipe isn't copyrightable but is still protected under trade secret law. I imagine that the same would apply. I think the major difference with software copyright is that I can just decompile your binary or copy a binary and give it to other people. For SAAS companies that don't distribute binaries, I imagine they basically have the same protections against rogue employees.

cillian64•about 8 hours ago

To look at it another way, just because some code I work on at my job is derived from open source MIT-licensed code doesn't mean I personally have the right to distribute it if my company doesn't want me to. I'd guess this comes under some generic "confidential information" clause in the employment contract.

palata•about 7 hours ago

Hmm your example is different: if you manually write code, there is a copyright for it whether it is derived from an MIT-licence or not. If you don't own that copyright (because your employer does), then you don't have the right to distribute it because it is not your code.

If you generate the same code with AI, now it does not have a copyright. If it depends on an MIT library, then the MIT library has a copyright and you have to honour the licence. But the code you produced does not have a copyright (because it was generated by an AI). And therefore nobody "owns" it. My question is: can your employer prevent you from distributing something they don't own?

ModernMech•about 8 hours ago

Presumably company policy would be implicated here, not copyright law. Whether or not it's copyrightable, what you create using AI is work product.

joshka•about 7 hours ago

If you want to go much deeper, https://www.copyright.gov/ai/ is particularly good at least on the side of comprehensiveness.

hackingonempty•about 7 hours ago

Nobody disputes that I own the copyright in a sound recording I made just by pushing the red button on my recorder. So it is a mystery to me that copyright to any sort of human conditioned machine generation is in dispute.

senaevren•about 7 hours ago

The sound recording analogy breaks down at the point where the recorder makes no creative decisions. Pressing record captures what is already there. Prompting Claude generates something that did not exist, through decisions the model makes about structure, naming, pattern, and implementation. The closer analogy is hiring a session musician and telling them the key and tempo. You own the recording under work-for-hire if they signed the right contract, but the creative expression in the performance is theirs unless explicitly assigned. The button you push to start the model is not the same button as the one on the recorder.

CamperBob2•about 3 hours ago

Fourier theory says that any sound, however complex, can be synthesized by summing sines and cosines. That's what an LLM does, if you twist the metaphor enough. It synthesizes complex outputs from simpler basis functions that are, or should be, uncopyrightable.

The fact that it inferred those basis functions from studying copyrighted works doesn't seem relevant. Nor does the fact that the "Fourier sums" sometimes coincide with larger fragments of works that are copyrighted. How weird would it be if that didn't happen?

e12e•about 8 hours ago

Seems to gloss over other kinds of contamination, beyond GPL code. Code from pirated text books, the problem with the entire language model being trained on copyright data, and on the possibility of the training data containing various copyrighted code.

embedding-shape•about 8 hours ago

> Code from pirated text books

Anthropic "solved" this by intermingling the texts extracted from pirated books (illegal) with texts extracted from the physical books they bought and destroyed (legal), so no one can clearly say if the copyrighted material it spits out came from a legal source or not. Everyone rejoiced.

senaevren•about 7 hours ago

The intermingling argument is actually central to the Bartz settlement structure. The settlement required destruction of the pirated dataset specifically because commingled training data creates an unresolvable provenance problem. For deployers building on Claude, EDPB Opinion 28/2024 requires a documented assessment of the foundation model's training data legal basis before deployment. "We cannot tell which outputs came from which source" is not a satisfactory answer to a regulator running that assessment. wrote about it before here: https://legallayer.substack.com/p/i-read-every-edpb-document...

e12e•about 4 hours ago

> books they bought and destroyed (legal)

They're only legal if training is fair use - and even I don't think it's immediately clear what would be the legal status of verbatim regurgitation of code in copyright, or code protected by patents?

AFAIK I (as a human developer) can't assume that I can go and copy code out of a text book, and then assume copyright and charge for a license to it?

embedding-shape•about 4 hours ago

> They're only legal if training is fair use

The judge seems to have said it's because they "transformed" the books (destroying them after digitalizing) in the process, that made it legal.

> Ultimately, Judge William Alsup ruled that this destructive scanning operation qualified as fair use—but only because Anthropic had legally purchased the books first, destroyed each print copy after scanning, and kept the digital files internally rather than distributing them. The judge compared the process to “conserv[ing] space” through format conversion and found it transformative. - https://arstechnica.com/ai/2025/06/anthropic-destroyed-milli...

bearjaws•about 8 hours ago

Article is incredibly fear mongering.

Twice in my career the owners of a company have wanted to sue competitors for stealing their "product" after poaching our staff.

Each time, the lawyers came in and basically told us that suing them for copyright is suicide, will inevitably be nearly impossible to prove, and money would be better spent in many other areas.

In fact, we ended up suing them (and they settled) for stealing our copyrighted clinical content, which they copied so blatantly they left our own typos and customer support phone number in it.

Go ahead, try to sue over your copyrighted code, 10 years and 100M later you will end up like Google v Oracle. What if the code is even 5% different? What about elements dictated by external constraints; hardware, industry standards, common programming practices, these aren't copyrightable.

Then you have merger doctrine, how many ways can we really represent the same basic functions?

Same goes with the copyleft argument, "code resembling copyleft" is incredibly vague, it would need to be verbatim the code, not resembling. Then you have the history of copyleft, there have been many abuses of copyleft and only ~10 notable lawsuits. Now because AI wrote it (which makes it _even harder_ to enforce), we will see a sudden outburst of copyleft cases? I doubt it.

Ultimately anyone can sue you for any reason, nothing is stopping anyone right now from suing you claiming AI stole their copyleft code.

tommy29tmar•about 8 hours ago

Maybe the useful test is not “who wrote this line?” but “can you show how it went from requirement/prompt/context to diff to human review/tests?” If you can’t, ownership is only one issue. You also can’t tell what was accepted as engineering work versus just copied output.

senaevren•about 7 hours ago

This is actually closer to how the Copyright Office thinks about it than the article makes clear. The registration guidance that emerged from the Thaler proceedings specifically asks applicants to describe the human creative contributions and how the AI was used. A documented workflow showing requirement, architectural decision, rejection of AI output, human restructuring, and review creates a paper trail that maps directly onto what the Office looks for. The can you show how it got here test you are describing is the practical version of the legal standard.

skadge•about 8 hours ago

This seems to be grounded in US law. Does anyone know if the same rules would apply in eg EU law?

zvr•about 6 hours ago

Most of this is based on Copyright legal framework, which is surprisingly homogeneous around the world. The discussions about ownership of AI-generated material are exactly the same in EU.

threepts•about 3 hours ago

Whoever pays for the tokens.

jMyles•about 2 hours ago

There is no such thing as ownership of a pattern of information. It has been an illusion, and that illusion is now fading.

padmabushan•about 8 hours ago

First answer who owns the model built with public data

senaevren•about 7 hours ago

The model ownership question and the output ownership question run on separate legal tracks and the piece focuses on the second deliberately. On the first: the model weights are owned by Anthropic under work-for-hire from their engineers regardless of what the training data contained. Training data copyright infringement is a separate tort claim against Anthropic, not a basis for anyone else to claim ownership of the model. The Bartz settlement resolved the pirated books claim without disturbing Anthropic's ownership of the weights. Owning the training data does not give you ownership of the model trained on it, any more than owning the paint gives you ownership of the painting.

kouru225•about 3 hours ago

IMO this is the greatest argument against AI as technofascism. The general public seems to believe that AI will usher in technofascism by claiming corporate ownership of AI output: the independent entrepreneur will be unable to compete against the corporations compute, every piece of data about you will be stolen and monetized by AI, and you will own nothing.

But AI might in fact do the exact opposite and reverse the privatization trend that the West has been going through for the last 400 years. All of our copyright laws rely on the idea that there is a human consciousness behind the copyright. The more AI has input, the less we can claim ownership. If AI returns everything to the commons, then it results in a much more egalitarian world.

Hilariously, many people, especially artists, see the return of the commons as an assault against them. They’re so captured by copyright that they assume any infringement on their copyright is inherently fascist. It’s ridiculous. Copyright is a corporations number 1 weapon when it comes to creating a moat and keeping the masses out.

The original intent of copyright, in fact, was an incentive to return an idea to the commons. Experts used to hide their discoveries in order to keep them for themselves. Copyright provided an opportunity to release this knowledge and still profit. There were even several cases where it was established that those who claimed copyright could retain copyright even if the idea had been previously discovered. This created a huge incentive: release the knowledge or risk having your process copyrighted by the opposition. But that system worked because copyright could only exist for so long (14 years, doubled if they filed again.)

Now copyright is a lifelong sentence at almost 100 years. The entire purpose of it has been undermined. Corporations own all your childhood and by the time you can profit off of it, it’s outdated.

A world where the mainstream is primarily a commons seems to me like an egalitarian world. I’d like to live in that world.

smashed•about 8 hours ago

The "if you generated the code at work using company tools, it's owned by your employer" affirmation in the article makes no sense to me?

If computer generated code is not copyrightable, ownership cannot be reassigned either.

conartist6•about 8 hours ago

It is copyrightable. A *human* can copyright code they wrote.

smashed•about 8 hours ago

I meant in the sense that the "tool" is an LLM and the "work" was vibe coded.

If vibe coded work is not copyrightable, it cannot be reassigned to the employer and become copyright protected.

senaevren•about 8 hours ago

This is the sharpest point in the thread. You are right if the output has no copyright to begin with, there is nothing to assign. The employer's contractual claim over purely AI-generated code is not a copyright claim, it is a trade secret and confidentiality claim. Those are weaker protections: they require the information to remain secret, they do not survive disclosure, and they cannot be enforced against independent creation of the same code. Most IP assignment clauses in employment contracts were not drafted with this scenario in mind and may be claiming rights that do not legally exist.

conartist6•about 8 hours ago

correct

croes•about 8 hours ago

How is it for human developers now if the company tool is a cloud tool and not running on company servers?

mensetmanusman•about 8 hours ago

It’s the same as photography. No photographer built the multibillion dollar supply chain for the optics train in a camera, nor did they build the city scape they are enjoying as a background, they simply set the stage and push a button.

whattheheckheck•about 1 hour ago

Ask chatgpt deep research citing court cases and it shows dark factory swe code are not copyrightable under current precedents.

Even steering it with prompts isn't enough. The guy couldn't copyright the image he made with ai, code is no different.

Maybe prompts written by humans are copyrightable.

Can't wait for the Billionaires to entrench in court they can steal everything for these machines and claim it as their own and maybe even reach for anything that it helps produce. Fuck that

DeathArrow•about 8 hours ago

I have a wood cutting machine and some wood. Who owns the timber?

bell-cot•about 8 hours ago

Sadly, IP "ownership" and copyright law are vastly more complex than ownership of physical stuff.

Or were you planning to reproduce the (say) Ford Motor Company's trademarked symbol in wood? If so, you're right back in the stinkin' swamp.

croes•about 8 hours ago

What is the wood in your example?

This is like a machine you ask for timber and you get timber but you didn’t need to provide any wood