Forum Activity for @soaringeagle

soaringeagle
@soaringeagle
01/30/15 03:50:19PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

ok so can you come up with a good way to do this/ ill cancle this
soaringeagle
@soaringeagle
01/30/15 03:19:17PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

ps when this finaly finishes i will put the results in afile and attach but what file types can i attach
soaringeagle
@soaringeagle
01/30/15 03:18:11PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

brian:
Strumelia:
Adding to Soaring's question:
since these paired images have different file number IDs in our ning folders (see my previous screenshot), will Paul's delete duplicates Tool recognize and delete the dupes even though they have diff # ids?

That's a good question. Looking at the code it looks like we are checking for duplicate ID's, which would not help here. Do the images have anything in common that we can key on? I don't have a Ning Archive to check it out.

Thanks!

im running a file size and hash compare its taking hours

but thats what i think will be needed is to do a file size then mdr hash test

im using the 1st code from here
http://www.commandlinefu.com/commands/view/3555/find-duplicate-files-based-on-size-first-then-md5-hash

but that is basicly what needs to be run and i think it should be un as a maintenence cycle in the quue as its kinda cpu intensive (not severely)

buti would recomend that before deletion it should also check for a custom form designer feild *_featured

if i have to refeature em all so be it
but if it specificly chodse the version thats not featured when 2 versions are found that would be best
soaringeagle
@soaringeagle
01/30/15 02:35:10PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

oh i was gonna ssk and forgot
if i did do a re-upload re-import with delete doesnt that just trunkate the database not remove the files

so that wont help either huh
soaringeagle
@soaringeagle
01/30/15 02:32:52PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

its a nightmare impaying like300 a month for the server and thinking ill have to addanother 100+ for another drive before theres any significant trafic

speaking of that can u please take a look at my titles and meta suggestion for sitebuilder and up that priority a notch just cause i really desperatekly need to take care of the seo asap or ill be broke very soon

though removing the excess wasted space is also up there on priorites list

if i couldremove 100 gb of excess photos that would be huge


f'ing ning
from now on gonna call em fing
i know if i call em and explain the issue they will say we will look into it but never get a thing done
soaringeagle
@soaringeagle
01/30/15 01:18:13PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

if i was to do a delete and re-import it would
but once there ...

im running

find -not -empty -type f -printf "%s\n" | sort -rn | uniq -d | xargs -I{} -n1 find -type f -size {}c -print0 | xargs -0 md5sum | sort | uniq -w32 --all-repeated=separate

to find alist of duplicates
soaringeagle
@soaringeagle
01/30/15 01:09:08PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

same names diferent numbers 80859 80860
soaringeagle
@soaringeagle
01/30/15 01:08:11PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

paul:
So to be clear, you have run the Delete Media tool and you are looking at what's left?

yes
its like here duplicates

https://www.dreadlockssite.com/graciela-valderrama/gallery/80859/1-photo-by-mauricio-gomez-amoretti-12-08-14#gallery_img
soaringeagle
@soaringeagle
01/30/15 01:07:24PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

actualy inspect element revealed that those weird urls arent coming from the photos they are sequentiial so no way to bulk delete argh
soaringeagle
@soaringeagle
01/30/15 12:50:32PM
3,304 posts

after final import any hope for duplicate photo deletes


Ning To Jamroom

there are 2 of nearly all gallery images not all but most
and ive aleready done the final import so the fixed importer to trap them is no help asthey are already there

because it appears they are all in a secondary numbered folder is there any ssh trickery that can delete all directories in the extrsa numbered directories like find gallery/*numbers* delete /*numbers

anything that will delete all those extras

if thats even where those search errors are comingin in my sitremap crawler im also getting thousands of duplicate contents i can post those error logs or send to email to help u find where all the doubles are
  207