FiveA to Reddit@lemmy.worldEnglish · 1 month agoReddit Will License Its Data to Train LLMs, So We Made a Firefox Extension That Lets You Replace Your Comments With Any (Non-Copyrighted) Text - The Ludditetheluddite.orgexternal-linkmessage-square42fedilinkarrow-up1299arrow-down18cross-posted to: becomeme@sh.itjust.worksreddit@lemmy.worldquitterreddit@jlai.lubyereddit@lemmy.worldtechnology@lemmy.worldluddite@lemmy.ml
arrow-up1291arrow-down1external-linkReddit Will License Its Data to Train LLMs, So We Made a Firefox Extension That Lets You Replace Your Comments With Any (Non-Copyrighted) Text - The Ludditetheluddite.orgFiveA to Reddit@lemmy.worldEnglish · 1 month agomessage-square42fedilinkcross-posted to: becomeme@sh.itjust.worksreddit@lemmy.worldquitterreddit@jlai.lubyereddit@lemmy.worldtechnology@lemmy.worldluddite@lemmy.ml
minus-squareabbadon420@lemm.eelinkfedilinkarrow-up4·1 month agoWhere can I find those archive dumps? The usual (unmentionable) torrent sites or is there a specific place for archive dumps?
minus-squareFaceDeer@fedia.iolinkfedilinkarrow-up4arrow-down2·edit-21 month agoThe place I know about off the top of my head is academictorrents.com where you can find lots of large data sets useful for academic research. The torrent files themselves are small, so I’m sure they can be found in other places too.
Where can I find those archive dumps? The usual (unmentionable) torrent sites or is there a specific place for archive dumps?
The place I know about off the top of my head is academictorrents.com where you can find lots of large data sets useful for academic research. The torrent files themselves are small, so I’m sure they can be found in other places too.