how can I repair defective URL metadata for Lemmy posts?
-
I've been using Lemmy since last August and I find it very useful for sharing articles.
Most of the time articles are very easy to upload with no real defects.
However, there are a few websites, such as apnewsDOTcom that send data that ends up as giving "Just a moment" for the headline, and no thumbnail image. [see https://lemmy.ml/post/26652484]Where would I look in the apnews webpages to find the data elements that need fixing, and what software would I need to enter the fixed data into Lemmy, so that the Lemmy post looks right?
Thanks in advance for any help I get for this.
-
-
I've been using Lemmy since last August and I find it very useful for sharing articles.
Most of the time articles are very easy to upload with no real defects.
However, there are a few websites, such as apnewsDOTcom that send data that ends up as giving "Just a moment" for the headline, and no thumbnail image. [see https://lemmy.ml/post/26652484]Where would I look in the apnews webpages to find the data elements that need fixing, and what software would I need to enter the fixed data into Lemmy, so that the Lemmy post looks right?
Thanks in advance for any help I get for this.
"Just a moment" is probably the anti bot/anti DDOS service that particular website uses kicking in. A real website would interact with the web page and redirect before most people would notice, but bots and scrapers grab the lightweight HTML and run with it.
There is no good solution for this. If you're the instance admin you could configure something like Flaresolverr to bypass bot protection pages, but you'd still need to update Lemmy to not grab the first page requested and wait for the redirect cycle to complete first.
As a user you can try picking links to sources that don't have this type of bot protection built in, or you could link to the AMP page which is usually cached on Google's servers without an anti bot system active.
-
I've been using Lemmy since last August and I find it very useful for sharing articles.
Most of the time articles are very easy to upload with no real defects.
However, there are a few websites, such as apnewsDOTcom that send data that ends up as giving "Just a moment" for the headline, and no thumbnail image. [see https://lemmy.ml/post/26652484]Where would I look in the apnews webpages to find the data elements that need fixing, and what software would I need to enter the fixed data into Lemmy, so that the Lemmy post looks right?
Thanks in advance for any help I get for this.
I don’t have any advice other than 1) read community rules and 2) post Lemmy support questions to [email protected] in the future.
-
I don’t have any advice other than 1) read community rules and 2) post Lemmy support questions to [email protected] in the future.
I really didn't consider it a support question since it isn't a problem with Lemmy, but with another website. But apparently that's not the consensus here.
-
"Just a moment" is probably the anti bot/anti DDOS service that particular website uses kicking in. A real website would interact with the web page and redirect before most people would notice, but bots and scrapers grab the lightweight HTML and run with it.
There is no good solution for this. If you're the instance admin you could configure something like Flaresolverr to bypass bot protection pages, but you'd still need to update Lemmy to not grab the first page requested and wait for the redirect cycle to complete first.
As a user you can try picking links to sources that don't have this type of bot protection built in, or you could link to the AMP page which is usually cached on Google's servers without an anti bot system active.
Thanks for the feedback. I'll look into checking for the AMP pages. (Never heard of those before, haven't done much with the web for quite a while.)
-
I don’t have any advice other than 1) read community rules and 2) post Lemmy support questions to [email protected] in the future.
After reading further, I see your point. Sorry about that.
-
System shared this topic onSystem shared this topic on