purl's posterous

« Back to posts
  • Viewed
    times
August 30, 2011

get unique content from duplicated *.tar.gz contents

  • Edit
  • Delete
  • Tags
  • Autopost

#!/usr/bin/perl -wln

$fn = $_;

do{ @F=split; $uniq{join qq(\t), @F[3,4,2]}||= join qq(\t), $fn, $F[5] } for grep {!m{^d}}  map{ m{(.*)} } qx(zcat $_ | tar -tv);

END{ print join qq(\t), $_, $uniq{$_} for sort keys %uniq }

Tweet
  • 0 responses
  • Like
  • Comment