GitHub accused of varying Copilot output to avoid copyright allegations

GitHub is alleged to have tuned its Copilot programming assistant to generate slight variations of ingested training code to prevent output from being flagged as a direct copy of licensed software.

This assertion appeared on Thursday in the amended complaint [PDF] against Microsoft, GitHub, and OpenAI over Copilot’s documented penchant for reproducing developers’ publicly posted, open source licensed code.

The lawsuit, initially filed last November on behalf of four unidentified (“J. Doe”) plaintiffs, claims that Copilot – a code suggestion tool built from OpenAI’s Codex model and commercialized by Microsoft’s GitHub – was trained on publicly posted code in a way that violates copyright law and software licensing requirements and that it presents other people’s code as its own.

Microsoft, GitHub, and OpenAI tried to have the case dismissed, but managed only to shake off some of the claims. The judge left intact the major copyright and licensing issues, and allowed the plaintiffs to refile several other claims with more details.

The amended complaint – now covering eight counts instead of twelve – retains accusations of violating the Digital Millennium Copyright Act, breach of contract (open source license violations), unfair enrichment, and unfair competition claims.

It adds several other allegations in place of those sent back for revision: breach of contract (selling licensed materials in violation of GitHub’s policies), intentional interference with prospective economic relations and negligent interference with prospective economic relations.

The revised complaint adds one additional “J. Doe” plaintiff whose code Copilot has allegedly reproduced. And it includes sample code written by the plaintiffs that Copilot has supposedly reproduced verbatim, although only for the court – the code samples have been redacted in order to prevent the plaintiffs from being identified.

The judge overseeing the case has permitted the plaintiffs to remain anonymous in court filings because of credible threats of violence [PDF] directed at their attorney. The Register understands that the plaintiffs are known to the defendants.

A cunning plan?

Thursday’s legal filing says that in July 2022, in response to public criticism of Copilot, GitHub introduced a user-adjustable Copilot filter called “Suggestions matching public code” to avoid seeing software suggestions that duplicate other people’s work.

“When the filter is enabled, GitHub Copilot checks code suggestions with their surrounding code of about 150 characters against public code on GitHub,” GitHub’s documentation explains. “If there is a match or near match, the suggestion will not be shown to you.”

However, the complaint contends the filter is essentially worthless because it only checks for exact matches and does nothing to detect output that has been slightly modified. In fact, the plaintiffs suggest that GitHub is trying to get away with copyright and license violations by varying Copilot’s output so that it doesn’t appear to have been copied exactly.

“In GitHub’s hands, the propensity for small cosmetic variations in Copilot’s Output is a feature, not a bug,” the amended complaint says. “These small cosmetic variations mean that GitHub can deliver to Copilot customers unlimited modified copies of Licensed Materials without ever triggering Copilot’s verbatim-code filter.”

The court filing points out that machine learning models like Copilot have a parameter that controls the extent to which output varies.

“On information and belief, GitHub has optimized the temperature setting of Copilot to produce small cosmetic variations of the Licensed Materials as often as possible, so that GitHub can deliver code to Copilot users that works the same way as verbatim code, while claiming that Copilot only produces verbatim code one percent of the time,” the amended complaint says. “Copilot is an ingenious method of software piracy.”

Microsoft’s GitHub in an email insisted otherwise.

“We firmly believe AI will transform the way the world builds software, leading to increased productivity and most importantly, happier developers,” a company spokesperson told The Register. “We are confident that Copilot adheres to applicable laws and we’ve been committed to innovating responsibly with Copilot from the start. We will continue to invest in and advocate for the AI-powered developer experience of the future.”

OpenAI did not respond to a request for comment. ®

Note: This article have been indexed to our site. We do not claim legitimacy, ownership or copyright of any of the content above. To see the article at original source Click Here

Related Posts

Why Niantic anticipates legal challenges from OOH companies and brands as it develops immersive AR activations

As software development company Niantic experiments with increasingly immersive augmented reality activations, the Pokémon Go developer is girding itself for a potential wave of unprecedented legal challenges.At the moment, the augmented reality space is a bit of a wild west, with creators using Niantic’s technology to virtually modify privately owned locations in the physical world
Read More

Apple Pay security breach allows theft even with locked iPhone

Pesquisadores do Reino Unido descobriram uma falha no Apple Pay que permite que os hackers realizem pagamentos sem contato a partir do seu iPhone. Os integrantes do grupo da University of Birmingham e da University of Surrey publicaram um artigo na quinta-feira (30) descrevendo o método pelo qual essa falha pode ser explorada. Os hackers…
Read More
Final Price of Huawei AITO M5 Settled thumbnail

Final Price of Huawei AITO M5 Settled

(Source: Huawei) Your browser doesn’t support HTML5 audio The official final retail price of AITO M5, an intelligent luxury SUV jointly developed by Seres and Huawei is settled recently. After subsidies, prices of the rear-drive Standard Edition, 4WD Performance Edition, 4WD Premium Edition are 249,800 yuan, 279,800 yuan and 319,800 yuan, respectively. At the Huawei…
Read More
Fiat Türkiye’ye özel ucuz bir donanım sınıfı çıkaracak! Hedef 150 bin TL! thumbnail

Fiat Türkiye’ye özel ucuz bir donanım sınıfı çıkaracak! Hedef 150 bin TL!

Türkiye’de en çok otomobil satan markalardan biri olan Fiat ile ilgili oldukça ilginç bir iddia ortaya atıldı. İşte konu ile ilgili tüm detaylar. 27.01.2022 19:33 28.01.2022 10:57 Türkiye’de otomobil fiyatlarının ortalamanın hayli üzerinde seyrettiğini söyleyebiliriz. Öyle ki artık 250 bin TL altına sıfır bir otomobil almak ne yazık ki mümkün değil. Genele baktığımızda otomobil fiyatlarının…
Read More
OnePlus lists specifications for the 10 Pro thumbnail

OnePlus lists specifications for the 10 Pro

För ett antal dagar sedan visade OnePlus upp det nya flaggskeppet OnePlus 10 Pro. Nu har företaget avslöjat en och annan specifikation för modellen! Det framgår att den bland annat kommer få en 6,67 tumsskärm med QHD- upplösning med uppdateringsfrekvens på 120Hz. Vidare ser vi Snapdragon 8 Gen 1 tillsammans med en primär 48 MP-kamera…
Read More
Index Of News
Total
0
Share