Sign in to follow this  
Followers 0
efa

KOI8-R charset. Add Unicode conversion for URL

6 posts in this topic

The "Resolving link obfuscation" engine, do not recognize "KOI8-R" charset for URL:

http://дешевокупи.рф/

and so decode it as:

Decimal ampersand decode: http://45H52>:C?8.[at]D/

Here some tracking url:

http://www.spamcop.net/sc?id=z5017764631zf...6c7c44dde27becz

http://www.spamcop.net/sc?id=z5017946718z5...78f7e772a77bd8z

http://www.spamcop.net/sc?id=z5017947867zb...bbf188ddeef938z

Edited by efa

Share this post


Link to post
Share on other sites

Not sure if conversion would help with those ones. Previous discussions on links in Cyrillic/KOI8-R characters noted a high incidence of "gaming" with DNS records. Looks like nothing has changed over all the years. What I see is:

"DeshevoKupi.rf moved to another domain OlaKupi.ru"

(which bizarrely came out of Google translate with no actual link specified). There is a deshevokupi.ru domain behind an active server too. Seems like they play by different rules.

Anyway do you want to add a request for parser development in the New Feature Requests? If so, will move this topic there but let's leave it where it is for a little longer should others be able to contribute more to the reporting help aspect.

Share this post


Link to post
Share on other sites
Not sure if conversion would help with those ones.

xComplaint V.0.12.26e (bash scri_pt) implement the conversion to UTF-8 and then (on Linux) all worked well.

This is the code:

   # xComplaint released as GNU GPL v3
   # $s is the index for spam.txt. headers.txt is the mail head only.
   # Test is conducted for UTF-8 support
   charset=`grep "charset=" headers.txt`
   charset=`echo $charset | awk -F\" '{ print $2 }'`
   if (test "$debug" = 1) then { echo Original charset: "$charset"; } fi
   if (test "$UTF8" = "1") then   # this OS is UTF-8 capable?
	  if (test "$debug" = 1) then { echo UTF-8 capable OS; } fi
	  iconv -t UTF-8 spam"$s".txt > /dev/null 2>&1   # test if already in UTF-8
	  if (test $? = "1" ) then   # need a conversion to UTF-8 ...
		 if (test "$debug" = 1) then { echo Converting to UTF-8 ...; } fi
		 iconv -f $charset -t UTF-8 spam"$s".txt > temp.txt
		 mv temp.txt spam"$s".txt
	  fi
   fi

Share this post


Link to post
Share on other sites

Thanks efa - your skill easily exceeds mine. But the point is the target (DeshevoKupi.rf) has moved or is somehow obscured. SpamCop would not have found an address for the host even if the URI had been properly converted. In other instances going back years that seems to be the same story. But maybe not all the time.

A problem is that spamvertized sites are not (never have been) a priority for SC.

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!


Register a new account

Sign in

Already have an account? Sign in here.


Sign In Now
Sign in to follow this  
Followers 0