weird image issue stalls sitemap crawler

soaringeagle
@soaringeagle
5 years ago
3,304 posts
ha vent been able to run a sitemap crawl in about a month
i just noticed 1 of the images it stalled on has an error no timeouts no 5cxx errors no log entries)
but https://www.dreadlockssite.com/gallery/image/gallery-image/86547/1820
see the error message on the page
im a lil scared to clear image cache and have more act up then


--
soaringeagle
head dreadhead at dreadlocks site
glider pilot student and member/volunteer coordinator with freedoms wings international soaring for people with disabilities

updated by @soaringeagle: 08/05/19 07:05:57PM
douglas
@douglas
5 years ago
2,767 posts
soaringeagle:
ha vent been able to run a sitemap crawl in about a month
i just noticed 1 of the images it stalled on has an error no timeouts no 5cxx errors no log entries)
but https://www.dreadlockssite.com/gallery/image/gallery-image/86547/1820
see the error message on the page
im a lil scared to clear image cache and have more act up then


That is the wrong URL for that image, it is missing the image size, which is what the error is telling you. It should be something like this:

https://www.dreadlockssite.com/gallery/image/gallery-image/86547/original

So somewhere in your templates you are creating links to images that are incorrect.

Hope this helps!


--

Douglas Hackney
Jamroom Team - Designer/Developer/Support
FAQ-Docs-Help Videos
soaringeagle
@soaringeagle
5 years ago
3,304 posts
there was a v= something i couldnt see all of which thougt was a cache validator or something
didnt include it'
but the crawler stops in random images regardless


--
soaringeagle
head dreadhead at dreadlocks site
glider pilot student and member/volunteer coordinator with freedoms wings international soaring for people with disabilities
soaringeagle
@soaringeagle
5 years ago
3,304 posts
this suddenly started no template changes but maybe a update to a module (anddo have beta enabled)
i'll try again it typically fails between 150k and 350k (as far as its gotten) sometimes as low as 30k pages crawled (out of roughly 980k)


--
soaringeagle
head dreadhead at dreadlocks site
glider pilot student and member/volunteer coordinator with freedoms wings international soaring for people with disabilities
soaringeagle
@soaringeagle
5 years ago
3,304 posts
ok here again\
this was never doing tis prior to several wee,s ago
is the image or gallery module beta
if so its a new bug
crawlstall.jpg
crawlstall.jpg  •  90KB




--
soaringeagle
head dreadhead at dreadlocks site
glider pilot student and member/volunteer coordinator with freedoms wings international soaring for people with disabilities
brian
@brian
5 years ago
10,136 posts
It's trying to DOWNLOAD the gallery image - that 's not a gallery view. I'm not sure what to tell you here without an actual error to report.


--
Brian Johnson
Founder and Lead Developer - Jamroom
https://www.jamroom.net
soaringeagle
@soaringeagle
5 years ago
3,304 posts
well the crawler just follows every link
and like i said it worked untill recently
the last example wasnt a download link i'll keep running it show u where it stalls, always on an image though


--
soaringeagle
head dreadhead at dreadlocks site
glider pilot student and member/volunteer coordinator with freedoms wings international soaring for people with disabilities
brian
@brian
5 years ago
10,136 posts
Images pretty much work, so we'll need either an actual ERROR from the image URL or an entry in your logs/error_log. Otherwise it just sounds like the crawler can't handle a bad response.


--
Brian Johnson
Founder and Lead Developer - Jamroom
https://www.jamroom.net
soaringeagle
@soaringeagle
5 years ago
3,304 posts
it happens very randomly and im thinking its related to these other issues with the beta/new core/followers/other updates that are causing multiple odd issues
including profiles missing from lists forum lists showing up weirdly, one of my most important profiles no longer behaves like a profile or has an index, but has a forum
its really random and confused

but i think has to all be related


--
soaringeagle
head dreadhead at dreadlocks site
glider pilot student and member/volunteer coordinator with freedoms wings international soaring for people with disabilities

Tags