Dan Romero pfp
Dan Romero
@dwr
Wonder if ChatGPT will be the last major model to be trained on the open web? robots.txt specifically disallowing crawling from LLMs unless getting paid for the data?
9 replies
0 recast
0 reaction

William Saar pfp
William Saar
@saarw
If AIs can generate enough value, it might be worth paying armies of Mechanical Turk-style workers to manually visit and rewrite web sites for copyright-approved training Facts and ideas can't be copyrighted, only particular expression
1 reply
0 recast
0 reaction

Travis A. Everett pfp
Travis A. Everett
@abathur
This would double down on the risk LLMs are laundering misinfo, no? I also expect avoiding liability is more complex than just paying someone with a pulse to thesaurus words. I think a jury would agree the work was copied if there's deep, undeniable structural similarity.
1 reply
0 recast
0 reaction