GitHub accused of varying Copilot output to avoid copyright allegations

GitHub is alleged to have tuned its Copilot programming assistant to generate slight variations of ingested training code to prevent output from being flagged as a direct copy of licensed software.

This assertion appeared on Thursday in the amended complaint [PDF] against Microsoft, GitHub, and OpenAI over Copilot’s documented penchant for reproducing developers’ publicly posted, open source licensed code.

The lawsuit, initially filed last November on behalf of four unidentified (“J. Doe”) plaintiffs, claims that Copilot – a code suggestion tool built from OpenAI’s Codex model and commercialized by Microsoft’s GitHub – was trained on publicly posted code in a way that violates copyright law and software licensing requirements and that it presents other people’s code as its own.

Microsoft, GitHub, and OpenAI tried to have the case dismissed, but managed only to shake off some of the claims. The judge left intact the major copyright and licensing issues, and allowed the plaintiffs to refile several other claims with more details.

The amended complaint – now covering eight counts instead of twelve – retains accusations of violating the Digital Millennium Copyright Act, breach of contract (open source license violations), unfair enrichment, and unfair competition claims.

It adds several other allegations in place of those sent back for revision: breach of contract (selling licensed materials in violation of GitHub’s policies), intentional interference with prospective economic relations and negligent interference with prospective economic relations.

The revised complaint adds one additional “J. Doe” plaintiff whose code Copilot has allegedly reproduced. And it includes sample code written by the plaintiffs that Copilot has supposedly reproduced verbatim, although only for the court – the code samples have been redacted in order to prevent the plaintiffs from being identified.

The judge overseeing the case has permitted the plaintiffs to remain anonymous in court filings because of credible threats of violence [PDF] directed at their attorney. The Register understands that the plaintiffs are known to the defendants.

A cunning plan?

Thursday’s legal filing says that in July 2022, in response to public criticism of Copilot, GitHub introduced a user-adjustable Copilot filter called “Suggestions matching public code” to avoid seeing software suggestions that duplicate other people’s work.

“When the filter is enabled, GitHub Copilot checks code suggestions with their surrounding code of about 150 characters against public code on GitHub,” GitHub’s documentation explains. “If there is a match or near match, the suggestion will not be shown to you.”

However, the complaint contends the filter is essentially worthless because it only checks for exact matches and does nothing to detect output that has been slightly modified. In fact, the plaintiffs suggest that GitHub is trying to get away with copyright and license violations by varying Copilot’s output so that it doesn’t appear to have been copied exactly.

“In GitHub’s hands, the propensity for small cosmetic variations in Copilot’s Output is a feature, not a bug,” the amended complaint says. “These small cosmetic variations mean that GitHub can deliver to Copilot customers unlimited modified copies of Licensed Materials without ever triggering Copilot’s verbatim-code filter.”

The court filing points out that machine learning models like Copilot have a parameter that controls the extent to which output varies.

“On information and belief, GitHub has optimized the temperature setting of Copilot to produce small cosmetic variations of the Licensed Materials as often as possible, so that GitHub can deliver code to Copilot users that works the same way as verbatim code, while claiming that Copilot only produces verbatim code one percent of the time,” the amended complaint says. “Copilot is an ingenious method of software piracy.”

Microsoft’s GitHub in an email insisted otherwise.

“We firmly believe AI will transform the way the world builds software, leading to increased productivity and most importantly, happier developers,” a company spokesperson told The Register. “We are confident that Copilot adheres to applicable laws and we’ve been committed to innovating responsibly with Copilot from the start. We will continue to invest in and advocate for the AI-powered developer experience of the future.”

OpenAI did not respond to a request for comment. ®

Note: This article have been indexed to our site. We do not claim legitimacy, ownership or copyright of any of the content above. To see the article at original source Click Here

Related Posts
Das neue Android 12L kommt bald – zuerst auf Google Pixel thumbnail

Das neue Android 12L kommt bald – zuerst auf Google Pixel

Android 12L: Google hat für die erste Jahreshälfte eine neue Android-Version geplant, die sehr bald auch für Pixel-Smartphones erscheint. Android 12L ist das nächste größere Upgrade. Es erscheint auch für die Pixel-Reihe. Warten müssen wir darauf wohl nicht mehr lange. Google bereitet gerade den Start der nächsten Android-Version vor, die diesmal allerdings früh im neuen…
Read More
健身达人分享2分钟入眠技巧 火爆TikTok thumbnail

健身达人分享2分钟入眠技巧 火爆TikTok

还在为失眠而发愁吗?近日知名健身教练 Justin Agustin 在 YouTube 和 TikTok 上分享了 2 分钟快速入眠的军用技巧,在这两个平台上呈现了病毒式传播。 如何在 2 分钟内快速入眠如何实现高质量睡眠Agustin 表示该技巧是为了帮助士兵在需要休息的任何时候快速入睡。Agustin 在视频中说:“实际上,这项技巧已经在部队中得到证明,在你闭上眼睛后可以两分钟内入睡”。他随后分解了快速入睡的细节,以便其他人可以尝试一下。这个技巧主要是围绕着放松你的身体,从你的头到你的脚趾。不过,诀窍在于有计划地进行,这样你就可以控制身体的各个部分,帮助它们“关闭”。要开始学习军事睡眠技巧,你要在床上躺下。Agustin 说,你应该完全平躺下来,手臂放在身侧,双手平放在床上。Agustin 在自己的床上展示这一技巧说道,首先要放松前额的肌肉,然后是你的眼睛,你的脸颊,以及你脸部的其他部位。接下来系统地放松你身体的其他部分。这包括你的胸部、你的手臂、你的腿,甚至你的脚和脚趾。Agustin 说,在放松的同时,你应该思考两种情况。第一个场景是,你平躺在平静的湖面上的独木舟上,头顶上有一片晴朗的天空。接下来,想一想在一个漆黑的房间里躺在吊床上。如果你的头脑开始怀疑,Agustin 建议对自己重复“不要想”十秒钟,以重新集中你的注意力。一旦你掌握了这个军事睡眠技巧,他说你应该继续做六个星期来练习并可能掌握它。
Read More
Ключом к сверхпроводимости при комнатной температуре может стать свет thumbnail

Ключом к сверхпроводимости при комнатной температуре может стать свет

12.02.2022 [11:19],  Геннадий Детинич Учёные из США, Германии, Японии и Южной Кореи провели исследование, которое даёт надежду на обеспечение сверхпроводимости при комнатной температуре. Потребление электроэнергии быстро растёт, и мир начинает нуждаться в линиях передачи без потерь. Оказалось, что лазерные импульсы способны запускать сверхпроводимость не хуже электромагнитного поля, что открывает новый путь к созданию сверхпроводимости при…
Read More
Record: Germans spend almost three billion euros on apps thumbnail

Record: Germans spend almost three billion euros on apps

Deutsche geben mehr Geld denn je für Apps aus. (Foto: Bloomicon / Shutterstock) Nach deutlichen Steigerungen in den Vorjahren schätzt der Branchenverband Bitkom die Ausgaben der Deutschen für Smartphone-Apps im laufenden Jahr auf drei Milliarden Euro. Dabei werden die höchsten Umsätze mit In-App-Käufen erzielt. Smartphone-Apps erfreuen sich stark wachsender Beliebtheit. Wie der Branchenverband Bitkom am…
Read More
Index Of News