mesa@piefed.social to Technology@lemmy.worldEnglish · 1 day agoTesla said it didn’t have key data in a fatal crash. Then a hacker found it.www.washingtonpost.comexternal-linkmessage-square25fedilinkarrow-up1514arrow-down11file-textcross-posted to: technology@beehaw.org
arrow-up1513arrow-down1external-linkTesla said it didn’t have key data in a fatal crash. Then a hacker found it.www.washingtonpost.commesa@piefed.social to Technology@lemmy.worldEnglish · 1 day agomessage-square25fedilinkfile-textcross-posted to: technology@beehaw.org
minus-squareMonkderVierte@lemmy.ziplinkfedilinkEnglisharrow-up6·edit-27 hours agoHow does archive get the unpaywalled version? I don’t think they pay the subscription for every single tabloid out there? Asking for a friend.
minus-squarestoly@lemmy.worldlinkfedilinkEnglisharrow-up3·2 hours agoThe paywall is JavaScript but the content is still in plaintext below. The crawlers don’t read the JavaScript.
minus-squareMonkderVierte@lemmy.ziplinkfedilinkEnglisharrow-up4·2 hours agoDisabling 3rd-party js has no paywall, but only the first paragraph too. Crawlers get full access?
minus-squareAnarchistArtificer@slrpnk.netlinkfedilinkEnglisharrow-up3·6 hours agoI think they use the same thing that web crawlers use. If Google’s crawler couldn’t access the content of the page (or could only access a limited amount of content), it would likely rank far lower in search results
minus-squareMonkderVierte@lemmy.ziplinkfedilinkEnglisharrow-up2·edit-22 hours agoBtw, how come there is no search engine where you can sort and filter how you want instead of how they want? (except self-hosted i mean) Pornhub has better searchability than, uh, all search sites i know.
How does archive get the unpaywalled version? I don’t think they pay the subscription for every single tabloid out there?
Asking for a friend.
The paywall is JavaScript but the content is still in plaintext below. The crawlers don’t read the JavaScript.
Disabling 3rd-party js has no paywall, but only the first paragraph too. Crawlers get full access?
I think they use the same thing that web crawlers use. If Google’s crawler couldn’t access the content of the page (or could only access a limited amount of content), it would likely rank far lower in search results
Btw, how come there is no search engine where you can sort and filter how you want instead of how they want? (except self-hosted i mean)
Pornhub has better searchability than, uh, all search sites i know.