This January, Byeongjun Park, a researcher in synthetic intelligence (AI), acquired a stunning e-mail. Two researchers from India informed him that an AI-generated manuscript had used strategies from one in every of his papers, with out credit score.
Park appeared up the manuscript. It wasn’t formally revealed, however had been posted on-line (see go.nature.com/45pdgqb) as one in every of quite a few papers generated by a instrument referred to as The AI Scientist — introduced in 2024 by researchers at Sakana AI, an organization in Tokyo1.
The AI Scientist is an instance of absolutely automated analysis in laptop science. The instrument makes use of a big language mannequin (LLM) to generate concepts, writes and runs the code by itself, after which writes up the outcomes as a analysis paper — clearly marked as AI-generated. It’s the beginning of an effort to have AI programs make their very own analysis discoveries, says the workforce behind it.
The AI-generated work wasn’t copying his paper instantly, Park noticed. It proposed a brand new structure for diffusion fashions, the kinds of mannequin behind image-generating instruments. Park’s paper handled bettering how these fashions are educated2. However to his eyes, the 2 did share comparable strategies. “I used to be stunned by how intently the core methodology resembled that of my paper,” says Park, who works on the Korea Superior Institute of Science and Know-how (KAIST) in Daejeon, South Korea.
Researchers constructed an ‘AI Scientist’ — what can it do?
The researchers who e-mailed Park, Tarun Gupta and Danish Pruthi, are laptop scientists on the Indian Institute of Science in Bengaluru. They are saying that the difficulty is greater than simply his paper.
In February, Gupta and Pruthi reported3 that they’d discovered a number of examples of AI-generated manuscripts that, in response to exterior consultants they consulted, used others’ concepts with out attribution, though with out instantly copying phrases and sentences.
Gupta and Pruthi say that this quantities to the software program instruments plagiarizing different concepts — albeit with no in poor health intention on the a part of their creators. “A good portion of LLM-generated analysis concepts seem novel on the floor however are literally skillfully plagiarized in ways in which make their originality troublesome to confirm,” they write.
In July, their work gained an ‘excellent paper’ award on the Affiliation for Computational Linguistics convention in Vienna.
However a few of their findings are disputed. The workforce behind The AI Scientist informed Nature that it strongly disagrees with Gupta and Pruthi’s findings, and doesn’t settle for that any plagiarism occurred in The AI Scientist case research that the paper examines. In Park’s particular case, one unbiased specialist informed Nature that he thought the AI manuscript’s strategies didn’t overlap sufficient with Park’s paper to be termed plagiarism. Park himself additionally demurred at utilizing ‘plagiarism’ to explain what he noticed as a powerful methodological overlap.
Past the precise debate about The AI Scientist lies a broader concern. So many papers are revealed annually — particularly in laptop science — that researchers already battle to maintain observe of whether or not their concepts are actually progressive, says Joeran Beel, a specialist in machine-learning and knowledge science on the College of Siegen, Germany.
And if extra LLM-based instruments are used to generate concepts, this might deepen the erosion of mental credit score in science. As a result of LLMs work partially by remixing and interpolating the textual content they’re educated on, it could be pure for them to borrow from earlier work, says Parshin Shojaee, a pc scientist on the Virginia Tech Analysis Heart — Arlington.
The difficulty of ‘concept plagiarism’, though little mentioned, is already an issue with human-authored papers, says Debora Weber-Wulff, a plagiarism researcher on the College of Utilized Sciences, Berlin, and she or he expects that it’s going to worsen with work created by AI. However, not like the extra acquainted types of plagiarism — involving copied or subtly rewritten sentences — it’s laborious to show the reuse of concepts, she says.
That makes it troublesome to see the best way to automate the duty of checking for true novelty or originality, to match the tempo at which AIs are going to have the ability to synthesize manuscripts.
“There’s nobody solution to show concept plagiarism,” Weber-Wulff says.
Overlapping strategies
Dangerous actors can, in fact, already use AI to intentionally plagiarize others or rewrite others’ work to move it off as their very own (see Nature https://doi.org/gt5rjz; 2025). However Gupta and Pruthi puzzled if well-intentioned AI approaches is likely to be utilizing others’ strategies or concepts too.
Gupta and Pruthi have been first alerted to the difficulty after they learn a 2024 research led by Chenglei Si, a pc scientist at Stanford College in California4. Si’s workforce requested each folks and LLMs to generate “novel analysis concepts” on matters in laptop science. Though Si’s protocol included a novelty verify and requested human reviewers to evaluate the concepts, Gupta and Pruthi argue that a number of the AI-generated concepts produced by the protocol however lifted from current works — and so weren’t ‘novel’ in any respect.
They picked out one of many AI-generated concepts in Si’s paper, which they are saying borrowed from a paper first posted as a preprint5 in 2023. Si tells Nature that he agrees that the ‘high-level’ concept was much like materials within the preprint, however that “whether or not the low-level implementation variations depend as novelty might be a subjective judgement”. Shubhendu Trivedi, a machine-learning researcher who co-authored that 2023 preprint, and was till just lately on the Massachusetts Institute of Know-how in Cambridge, says that “the LLM-generated paper was mainly similar to our paper, regardless of some superficial-level variations”.

AI is complicating plagiarism. How ought to scientists reply?
Gupta and Pruthi additional examined their concern by taking the 4 AI-generated analysis proposals publicly launched by Si’s workforce and the ten AI manuscripts launched by Sakana AI, and generated 36 contemporary proposals themselves, utilizing Si’s methodology. They then requested 13 specialists to attempt to discover overlaps in strategies between the AI-made works and current papers, utilizing a 5-point scale, on which 5 corresponded to a ‘one-to-one mapping in strategies’ and 4 to ‘mix-and-match from two-to-three prior works’; 3 and a couple of represented more-modest overlaps and 1 indicated no overlap. “It’s primarily about copying of the concept or crux of the paper,” says Gupta.
The researchers additionally requested the authors of authentic papers recognized by the specialists to present their very own views on the overlaps.
Together with this step, Gupta and Pruthi report that 12 papers of their pattern of AI-generated works reached ranges 4 and 5, implying, they mentioned, a plagiarism proportion of 24%; the determine rises to 18 (36%) if instances through which the unique authors didn’t reply are included. Some have been from Sakana’s and Si’s work, though Gupta and Pruthi focus on intimately solely the examples reported on this story.
Additionally they mentioned they’d discovered the same form of overlap in an AI-generated manuscript (see go.nature.com/4oym4ru) that, Sakana introduced this March, had handed via a stage of peer overview for a workshop at a prestigious machine-learning convention, the Worldwide Convention on Studying Representations.
On the time, the agency mentioned that this was the primary fully-AI-generated paper to move human peer overview. It additionally defined that it had agreed with workshop organizers to trial placing AI-generated papers into peer overview and to withdraw them in the event that they have been accepted, as a result of the group hadn’t but determined whether or not AI-generated papers needs to be revealed in convention proceedings. (The workshop organizers declined Nature’s request for remark.)
Gupta and Pruthi say that this paper borrowed its core contribution from a 2015 work6, with out citing it. Their report quotes the authors of that paper, laptop scientists David Krueger and Roland Memisevic, as saying that the Sakana work is “definitively not novel”, and figuring out a second uncited manuscript7 that the paper borrowed from.
One other laptop scientist, Radu Ionescu on the College of Bucharest, informed Nature he rated the similarity between the AI-generated work and Krueger and Memisevic’s paper as a 5.
Krueger, who’s on the College of Montreal in Canada, informed Nature that the associated works ought to have been cited, however that he “wouldn’t be stunned to see human researchers reinvent this and miss earlier work” too. “I believe this AI system and others aren’t able to attaining tutorial requirements for referencing associated work,” he mentioned, including that the AI paper was “extraordinarily low high quality total”. However he wasn’t certain whether or not the phrase plagiarism needs to be utilized, as a result of he feels that time period implies that the individual (or AI instrument) reusing strategies was conscious of earlier work, however selected to not cite it.
Pushback
The workforce behind The AI Scientist, which incorporates researchers on the College of Oxford, UK, and the College of British Columbia in Vancouver, Canada, pushed again strongly in opposition to Gupta and Pruthi’s work when requested by Nature. “The plagiarism claims are false,” the workforce wrote in an e-mailed point-by-point critique, including that they have been “unfounded, inaccurate, excessive, and needs to be ignored”.
On two AI Scientist manuscripts mentioned in Gupta and Pruthi’s paper, for example, the workforce says that these works have completely different hypotheses from these within the earlier papers and apply them to completely different domains, even when some parts of the strategies are associated.

Is it OK for AI to put in writing science papers? Nature survey reveals researchers are cut up
The references discovered by the specialists for Gupta and Pruthi’s evaluation are work that the AI-generated papers may have cited, however nothing extra, the AI Scientist workforce says, including: “What they need to have reported is a few associated work that went uncited (a every day prevalence by human authors).” The workforce says it could be “acceptable” to have cited Park’s paper. Within the case of Krueger’s paper and the second uncited manuscript, the AI Scientist workforce says, “these two papers are associated, so, whereas it’s an on a regular basis prevalence by people to not embrace works like this, it could have been good for The AI Scientist to quote them”.
Ben Hoover, a machine-learning researcher on the Georgia Institute of Know-how in Atlanta who makes a speciality of diffusion fashions, informed Nature that he’d rating the overlap with Park’s paper as a ‘3’ on Gupta’s scale. He mentioned the AI-generated paper is of a lot decrease high quality and fewer thorough than Park’s work, and will have cited it, however “I might not go as far as to say plagiarism.” Gupta and Pruthi’s evaluation depends on ‘superficial similarities’ between generic statements within the AI-generated work that, when learn intimately, don’t meaningfully map to Park’s paper, he provides. Ionescu informed Nature he would give the AI-generated paper a score of two or 3.
Park judges the overlap together with his paper to be a lot stronger than Hoover’s and Ionescu’s rankings. He says he would give it a rating of 5 on Gupta’s scale, and provides that it “displays a powerful methodological resemblance that I take into account noteworthy.” Even so, this doesn’t essentially align with what he sees because the authorized or moral definition of plagiarism, he informed Nature.
What counts as plagiarism
A part of the disagreement may stem from completely different operational understandings of what ‘plagiarism’ means, particularly relating to overlap in concepts or strategies. Researchers who research plagiarism maintain completely different views on the time period from these of a number of the laptop scientists within the present debate, says Weber-Wulff.
“Plagiarism is a phrase we should always and do reserve for excessive instances of intentional fraudulent dishonest,” the AI Scientist workforce wrote, including that Gupta and Pruthi “are wildly out of line with established conventions concerning what counts as plagiarism in academia”. However Weber-Wulff disagrees: she says that intent shouldn’t be an element. “The machine has no intent,” she says. “We don’t have mechanism for explaining why the system is saying one thing and the place it acquired it from, as a result of these programs aren’t constructed to present references.”
Weber-Wulff’s personal favoured definition of plagiarism is that it happens when a manuscript “makes use of phrases, concepts, or work merchandise attributable to a different identifiable individual or supply with out correctly attributing the work to the supply from which it was obtained in a scenario in which there’s a official expectation of authentic authorship”. That definition was produced by Teddi Fishman, the previous director of a US non-profit consortium of universities referred to as the Worldwide Heart for Tutorial Integrity.