E:\!CORPORA\CxG-Background-Corpus\!Frontiers>python classify_unmask.py Starting unigrams and cc (28000, 30000) 28000 {'C': 0.001, 'loss': 'hinge'} STARTING UNMASKING ROUND 1 eng cc unigrams 1 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 1.00 0.99 0.99 5000 ch 1.00 0.99 1.00 2688 gb 1.00 1.00 1.00 5000 ie 1.00 1.00 1.00 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 1.00 1.00 1.00 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 1.00 1.00 3788 us 0.99 1.00 0.99 5000 za 1.00 1.00 1.00 5000 avg / total 1.00 1.00 1.00 66476 Reducing feature vectors. (322587, 30000) (66476, 30000) (322587, 29980) (66476, 29980) STARTING UNMASKING ROUND 2 eng cc unigrams 2 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.99 0.99 5000 ch 1.00 1.00 1.00 2688 gb 0.99 1.00 1.00 5000 ie 1.00 1.00 1.00 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 1.00 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 1.00 0.99 5000 za 1.00 1.00 1.00 5000 avg / total 1.00 1.00 1.00 66476 Reducing feature vectors. (322587, 29980) (66476, 29980) (322587, 29959) (66476, 29959) STARTING UNMASKING ROUND 3 eng cc unigrams 3 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.99 0.99 5000 ch 1.00 1.00 1.00 2688 gb 0.99 1.00 1.00 5000 ie 1.00 0.99 1.00 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 1.00 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 1.00 5000 avg / total 1.00 1.00 1.00 66476 Reducing feature vectors. (322587, 29959) (66476, 29959) (322587, 29935) (66476, 29935) STARTING UNMASKING ROUND 4 eng cc unigrams 4 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.99 0.99 5000 ch 1.00 0.99 1.00 2688 gb 0.99 1.00 1.00 5000 ie 1.00 0.99 1.00 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 1.00 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 1.00 5000 avg / total 1.00 1.00 1.00 66476 Reducing feature vectors. (322587, 29935) (66476, 29935) (322587, 29909) (66476, 29909) STARTING UNMASKING ROUND 5 eng cc unigrams 5 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.99 0.99 5000 ch 1.00 0.99 0.99 2688 gb 0.99 1.00 0.99 5000 ie 1.00 0.99 1.00 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 1.00 1.00 3788 us 0.99 1.00 0.99 5000 za 0.99 1.00 1.00 5000 avg / total 1.00 1.00 1.00 66476 Reducing feature vectors. (322587, 29909) (66476, 29909) (322587, 29885) (66476, 29885) STARTING UNMASKING ROUND 6 eng cc unigrams 6 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 1.00 0.99 0.99 2688 gb 0.99 1.00 0.99 5000 ie 1.00 0.99 1.00 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 1.00 5000 avg / total 1.00 1.00 1.00 66476 Reducing feature vectors. (322587, 29885) (66476, 29885) (322587, 29859) (66476, 29859) STARTING UNMASKING ROUND 7 eng cc unigrams 7 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.99 0.99 5000 ch 1.00 0.99 0.99 2688 gb 0.99 1.00 0.99 5000 ie 1.00 0.99 0.99 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 1.00 1.00 1.00 66476 Reducing feature vectors. (322587, 29859) (66476, 29859) (322587, 29831) (66476, 29831) STARTING UNMASKING ROUND 8 eng cc unigrams 8 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 1.00 0.99 0.99 2688 gb 0.99 1.00 0.99 5000 ie 1.00 0.99 0.99 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 1.00 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29831) (66476, 29831) (322587, 29805) (66476, 29805) STARTING UNMASKING ROUND 9 eng cc unigrams 9 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 0.99 0.99 0.99 2688 gb 0.99 1.00 0.99 5000 ie 1.00 0.99 0.99 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29805) (66476, 29805) (322587, 29779) (66476, 29779) STARTING UNMASKING ROUND 10 eng cc unigrams 10 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 1.00 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29779) (66476, 29779) (322587, 29752) (66476, 29752) STARTING UNMASKING ROUND 11 eng cc unigrams 11 precision recall f1-score support au 0.99 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 1.00 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 1.00 1.00 5000 my 1.00 1.00 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29752) (66476, 29752) (322587, 29726) (66476, 29726) STARTING UNMASKING ROUND 12 eng cc unigrams 12 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 1.00 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 1.00 1.00 5000 my 1.00 0.99 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29726) (66476, 29726) (322587, 29698) (66476, 29698) STARTING UNMASKING ROUND 13 eng cc unigrams 13 precision recall f1-score support au 0.99 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 1.00 1.00 5000 my 1.00 0.99 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29698) (66476, 29698) (322587, 29670) (66476, 29670) STARTING UNMASKING ROUND 14 eng cc unigrams 14 precision recall f1-score support au 0.99 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 0.99 0.99 5000 my 1.00 0.99 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.99 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29670) (66476, 29670) (322587, 29644) (66476, 29644) STARTING UNMASKING ROUND 15 eng cc unigrams 15 precision recall f1-score support au 1.00 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 1.00 0.99 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.98 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29644) (66476, 29644) (322587, 29618) (66476, 29618) STARTING UNMASKING ROUND 16 eng cc unigrams 16 precision recall f1-score support au 0.99 1.00 1.00 5000 ca 0.99 0.98 0.99 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 0.99 1.00 5000 my 1.00 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.98 0.98 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29618) (66476, 29618) (322587, 29591) (66476, 29591) STARTING UNMASKING ROUND 17 eng cc unigrams 17 precision recall f1-score support au 0.99 1.00 0.99 5000 ca 0.99 0.98 0.99 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 0.99 1.00 5000 my 1.00 0.99 1.00 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.98 0.99 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29591) (66476, 29591) (322587, 29564) (66476, 29564) STARTING UNMASKING ROUND 18 eng cc unigrams 18 precision recall f1-score support au 0.99 1.00 0.99 5000 ca 0.99 0.98 0.99 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 1.00 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.98 0.98 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.99 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29564) (66476, 29564) (322587, 29536) (66476, 29536) STARTING UNMASKING ROUND 19 eng cc unigrams 19 precision recall f1-score support au 0.99 1.00 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 1.00 1.00 5000 my 1.00 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.98 0.98 5000 ph 1.00 0.99 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.98 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29536) (66476, 29536) (322587, 29509) (66476, 29509) STARTING UNMASKING ROUND 20 eng cc unigrams 20 precision recall f1-score support au 0.99 1.00 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 0.99 1.00 5000 my 1.00 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.99 0.98 0.98 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.99 0.99 0.99 5000 za 0.98 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29509) (66476, 29509) (322587, 29482) (66476, 29482) STARTING UNMASKING ROUND 21 eng cc unigrams 21 precision recall f1-score support au 0.99 1.00 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 0.99 1.00 5000 my 1.00 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.98 0.98 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.98 0.99 0.99 5000 za 0.98 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29482) (66476, 29482) (322587, 29454) (66476, 29454) STARTING UNMASKING ROUND 22 eng cc unigrams 22 precision recall f1-score support au 0.99 1.00 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 0.99 1.00 5000 my 1.00 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.98 0.98 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.98 0.99 0.99 5000 za 0.98 1.00 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29454) (66476, 29454) (322587, 29427) (66476, 29427) STARTING UNMASKING ROUND 23 eng cc unigrams 23 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 1.00 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.98 0.98 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29427) (66476, 29427) (322587, 29401) (66476, 29401) STARTING UNMASKING ROUND 24 eng cc unigrams 24 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 1.00 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.98 0.98 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29401) (66476, 29401) (322587, 29375) (66476, 29375) STARTING UNMASKING ROUND 25 eng cc unigrams 25 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 1.00 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.98 0.98 5000 ph 1.00 1.00 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29375) (66476, 29375) (322587, 29350) (66476, 29350) STARTING UNMASKING ROUND 26 eng cc unigrams 26 precision recall f1-score support au 0.99 1.00 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.98 0.98 5000 ph 1.00 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29350) (66476, 29350) (322587, 29329) (66476, 29329) STARTING UNMASKING ROUND 27 eng cc unigrams 27 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 1.00 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29329) (66476, 29329) (322587, 29306) (66476, 29306) STARTING UNMASKING ROUND 28 eng cc unigrams 28 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29306) (66476, 29306) (322587, 29281) (66476, 29281) STARTING UNMASKING ROUND 29 eng cc unigrams 29 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 1.00 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29281) (66476, 29281) (322587, 29254) (66476, 29254) STARTING UNMASKING ROUND 30 eng cc unigrams 30 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 1.00 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29254) (66476, 29254) (322587, 29226) (66476, 29226) STARTING UNMASKING ROUND 31 eng cc unigrams 31 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 1.00 0.99 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29226) (66476, 29226) (322587, 29200) (66476, 29200) STARTING UNMASKING ROUND 32 eng cc unigrams 32 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 1.00 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29200) (66476, 29200) (322587, 29173) (66476, 29173) STARTING UNMASKING ROUND 33 eng cc unigrams 33 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.99 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 1.00 0.99 1.00 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29173) (66476, 29173) (322587, 29146) (66476, 29146) STARTING UNMASKING ROUND 34 eng cc unigrams 34 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.98 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.99 0.99 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 1.00 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29146) (66476, 29146) (322587, 29119) (66476, 29119) STARTING UNMASKING ROUND 35 eng cc unigrams 35 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.99 0.99 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29119) (66476, 29119) (322587, 29091) (66476, 29091) STARTING UNMASKING ROUND 36 eng cc unigrams 36 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.99 0.98 5000 ie 0.99 0.98 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.98 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29091) (66476, 29091) (322587, 29063) (66476, 29063) STARTING UNMASKING ROUND 37 eng cc unigrams 37 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.99 0.98 5000 ie 0.99 0.98 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29063) (66476, 29063) (322587, 29035) (66476, 29035) STARTING UNMASKING ROUND 38 eng cc unigrams 38 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.99 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.99 0.98 5000 ie 0.99 0.98 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.99 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29035) (66476, 29035) (322587, 29007) (66476, 29007) STARTING UNMASKING ROUND 39 eng cc unigrams 39 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.99 0.98 5000 ie 0.99 0.98 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.99 5000 za 0.98 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 29007) (66476, 29007) (322587, 28981) (66476, 28981) STARTING UNMASKING ROUND 40 eng cc unigrams 40 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.99 0.98 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.98 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28981) (66476, 28981) (322587, 28953) (66476, 28953) STARTING UNMASKING ROUND 41 eng cc unigrams 41 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.99 0.98 0.99 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 1.00 3788 us 0.98 0.99 0.98 5000 za 0.98 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28953) (66476, 28953) (322587, 28926) (66476, 28926) STARTING UNMASKING ROUND 42 eng cc unigrams 42 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.99 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.98 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28926) (66476, 28926) (322587, 28899) (66476, 28899) STARTING UNMASKING ROUND 43 eng cc unigrams 43 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.99 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28899) (66476, 28899) (322587, 28872) (66476, 28872) STARTING UNMASKING ROUND 44 eng cc unigrams 44 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.99 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28872) (66476, 28872) (322587, 28844) (66476, 28844) STARTING UNMASKING ROUND 45 eng cc unigrams 45 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.99 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28844) (66476, 28844) (322587, 28816) (66476, 28816) STARTING UNMASKING ROUND 46 eng cc unigrams 46 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.98 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.97 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28816) (66476, 28816) (322587, 28788) (66476, 28788) STARTING UNMASKING ROUND 47 eng cc unigrams 47 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.99 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.99 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28788) (66476, 28788) (322587, 28760) (66476, 28760) STARTING UNMASKING ROUND 48 eng cc unigrams 48 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.98 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28760) (66476, 28760) (322587, 28732) (66476, 28732) STARTING UNMASKING ROUND 49 eng cc unigrams 49 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.98 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.97 0.97 0.97 5000 ph 1.00 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28732) (66476, 28732) (322587, 28704) (66476, 28704) STARTING UNMASKING ROUND 50 eng cc unigrams 50 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.98 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28704) (66476, 28704) (322587, 28676) (66476, 28676) STARTING UNMASKING ROUND 51 eng cc unigrams 51 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.98 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.97 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28676) (66476, 28676) (322587, 28648) (66476, 28648) STARTING UNMASKING ROUND 52 eng cc unigrams 52 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.98 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28648) (66476, 28648) (322587, 28620) (66476, 28620) STARTING UNMASKING ROUND 53 eng cc unigrams 53 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.97 5000 ch 0.99 0.98 0.99 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28620) (66476, 28620) (322587, 28592) (66476, 28592) STARTING UNMASKING ROUND 54 eng cc unigrams 54 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.97 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28592) (66476, 28592) (322587, 28566) (66476, 28566) STARTING UNMASKING ROUND 55 eng cc unigrams 55 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28566) (66476, 28566) (322587, 28539) (66476, 28539) STARTING UNMASKING ROUND 56 eng cc unigrams 56 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.98 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28539) (66476, 28539) (322587, 28511) (66476, 28511) STARTING UNMASKING ROUND 57 eng cc unigrams 57 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.97 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28511) (66476, 28511) (322587, 28485) (66476, 28485) STARTING UNMASKING ROUND 58 eng cc unigrams 58 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.97 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 1.00 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.99 0.99 0.99 66476 Reducing feature vectors. (322587, 28485) (66476, 28485) (322587, 28459) (66476, 28459) STARTING UNMASKING ROUND 59 eng cc unigrams 59 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.98 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28459) (66476, 28459) (322587, 28431) (66476, 28431) STARTING UNMASKING ROUND 60 eng cc unigrams 60 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.99 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.97 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28431) (66476, 28431) (322587, 28403) (66476, 28403) STARTING UNMASKING ROUND 61 eng cc unigrams 61 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.99 0.99 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28403) (66476, 28403) (322587, 28375) (66476, 28375) STARTING UNMASKING ROUND 62 eng cc unigrams 62 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.99 0.99 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28375) (66476, 28375) (322587, 28347) (66476, 28347) STARTING UNMASKING ROUND 63 eng cc unigrams 63 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.97 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.99 0.99 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28347) (66476, 28347) (322587, 28319) (66476, 28319) STARTING UNMASKING ROUND 64 eng cc unigrams 64 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.97 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28319) (66476, 28319) (322587, 28292) (66476, 28292) STARTING UNMASKING ROUND 65 eng cc unigrams 65 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.98 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.99 0.99 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28292) (66476, 28292) (322587, 28264) (66476, 28264) STARTING UNMASKING ROUND 66 eng cc unigrams 66 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.97 0.98 0.98 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.99 0.98 0.99 5000 ng 1.00 1.00 1.00 5000 nz 0.97 0.96 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.99 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28264) (66476, 28264) (322587, 28236) (66476, 28236) STARTING UNMASKING ROUND 67 eng cc unigrams 67 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.97 0.98 0.97 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.98 0.98 5000 ng 1.00 1.00 1.00 5000 nz 0.97 0.96 0.97 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.98 0.98 5000 za 0.97 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28236) (66476, 28236) (322587, 28208) (66476, 28208) STARTING UNMASKING ROUND 68 eng cc unigrams 68 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.97 0.98 0.97 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.99 0.98 5000 ng 1.00 1.00 1.00 5000 nz 0.97 0.96 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.99 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28208) (66476, 28208) (322587, 28180) (66476, 28180) STARTING UNMASKING ROUND 69 eng cc unigrams 69 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.97 0.98 0.97 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.99 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.96 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.98 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28180) (66476, 28180) (322587, 28152) (66476, 28152) STARTING UNMASKING ROUND 70 eng cc unigrams 70 precision recall f1-score support au 0.99 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.97 0.98 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.96 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28152) (66476, 28152) (322587, 28124) (66476, 28124) STARTING UNMASKING ROUND 71 eng cc unigrams 71 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.98 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28124) (66476, 28124) (322587, 28096) (66476, 28096) STARTING UNMASKING ROUND 72 eng cc unigrams 72 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.98 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28096) (66476, 28096) (322587, 28068) (66476, 28068) STARTING UNMASKING ROUND 73 eng cc unigrams 73 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.98 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.98 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28068) (66476, 28068) (322587, 28040) (66476, 28040) STARTING UNMASKING ROUND 74 eng cc unigrams 74 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.98 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28040) (66476, 28040) (322587, 28012) (66476, 28012) STARTING UNMASKING ROUND 75 eng cc unigrams 75 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.98 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 28012) (66476, 28012) (322587, 27984) (66476, 27984) STARTING UNMASKING ROUND 76 eng cc unigrams 76 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.99 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27984) (66476, 27984) (322587, 27956) (66476, 27956) STARTING UNMASKING ROUND 77 eng cc unigrams 77 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.98 0.98 2688 gb 0.97 0.98 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 1.00 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27956) (66476, 27956) (322587, 27928) (66476, 27928) STARTING UNMASKING ROUND 78 eng cc unigrams 78 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.98 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.99 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27928) (66476, 27928) (322587, 27900) (66476, 27900) STARTING UNMASKING ROUND 79 eng cc unigrams 79 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27900) (66476, 27900) (322587, 27872) (66476, 27872) STARTING UNMASKING ROUND 80 eng cc unigrams 80 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.98 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27872) (66476, 27872) (322587, 27846) (66476, 27846) STARTING UNMASKING ROUND 81 eng cc unigrams 81 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.99 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27846) (66476, 27846) (322587, 27818) (66476, 27818) STARTING UNMASKING ROUND 82 eng cc unigrams 82 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.98 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27818) (66476, 27818) (322587, 27790) (66476, 27790) STARTING UNMASKING ROUND 83 eng cc unigrams 83 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.97 5000 in 0.99 0.99 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27790) (66476, 27790) (322587, 27762) (66476, 27762) STARTING UNMASKING ROUND 84 eng cc unigrams 84 precision recall f1-score support au 0.98 0.99 0.99 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.98 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.98 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27762) (66476, 27762) (322587, 27734) (66476, 27734) STARTING UNMASKING ROUND 85 eng cc unigrams 85 precision recall f1-score support au 0.98 0.99 0.98 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.98 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.97 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27734) (66476, 27734) (322587, 27706) (66476, 27706) STARTING UNMASKING ROUND 86 eng cc unigrams 86 precision recall f1-score support au 0.98 0.99 0.98 5000 ca 0.97 0.96 0.97 5000 ch 0.98 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.97 5000 in 0.99 0.98 0.98 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27706) (66476, 27706) (322587, 27678) (66476, 27678) STARTING UNMASKING ROUND 87 eng cc unigrams 87 precision recall f1-score support au 0.98 0.99 0.98 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27678) (66476, 27678) (322587, 27651) (66476, 27651) STARTING UNMASKING ROUND 88 eng cc unigrams 88 precision recall f1-score support au 0.98 0.99 0.98 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.95 0.96 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27651) (66476, 27651) (322587, 27623) (66476, 27623) STARTING UNMASKING ROUND 89 eng cc unigrams 89 precision recall f1-score support au 0.98 0.99 0.98 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.95 0.96 5000 ph 0.99 0.98 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27623) (66476, 27623) (322587, 27595) (66476, 27595) STARTING UNMASKING ROUND 90 eng cc unigrams 90 precision recall f1-score support au 0.98 0.99 0.98 5000 ca 0.97 0.96 0.97 5000 ch 0.98 0.97 0.98 2688 gb 0.97 0.97 0.97 5000 ie 0.98 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.95 0.95 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.99 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27595) (66476, 27595) (322587, 27567) (66476, 27567) STARTING UNMASKING ROUND 91 eng cc unigrams 91 precision recall f1-score support au 0.98 0.99 0.98 5000 ca 0.98 0.96 0.97 5000 ch 0.98 0.97 0.98 2688 gb 0.96 0.97 0.97 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.95 0.95 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.98 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27567) (66476, 27567) (322587, 27539) (66476, 27539) STARTING UNMASKING ROUND 92 eng cc unigrams 92 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.96 0.96 5000 ch 0.98 0.97 0.98 2688 gb 0.96 0.97 0.97 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27539) (66476, 27539) (322587, 27511) (66476, 27511) STARTING UNMASKING ROUND 93 eng cc unigrams 93 precision recall f1-score support au 0.98 0.99 0.98 5000 ca 0.97 0.96 0.96 5000 ch 0.99 0.97 0.98 2688 gb 0.96 0.97 0.97 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27511) (66476, 27511) (322587, 27483) (66476, 27483) STARTING UNMASKING ROUND 94 eng cc unigrams 94 precision recall f1-score support au 0.98 0.99 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.98 2688 gb 0.96 0.97 0.97 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27483) (66476, 27483) (322587, 27455) (66476, 27455) STARTING UNMASKING ROUND 95 eng cc unigrams 95 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.96 0.96 5000 ch 0.98 0.97 0.98 2688 gb 0.96 0.97 0.97 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27455) (66476, 27455) (322587, 27427) (66476, 27427) STARTING UNMASKING ROUND 96 eng cc unigrams 96 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.98 2688 gb 0.96 0.97 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.99 5000 my 0.98 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27427) (66476, 27427) (322587, 27400) (66476, 27400) STARTING UNMASKING ROUND 97 eng cc unigrams 97 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.98 0.95 0.96 5000 ch 0.98 0.97 0.98 2688 gb 0.96 0.97 0.97 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.98 5000 my 0.97 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.99 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27400) (66476, 27400) (322587, 27372) (66476, 27372) STARTING UNMASKING ROUND 98 eng cc unigrams 98 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.97 2688 gb 0.96 0.97 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.98 5000 my 0.97 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.99 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27372) (66476, 27372) (322587, 27344) (66476, 27344) STARTING UNMASKING ROUND 99 eng cc unigrams 99 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.97 2688 gb 0.96 0.97 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.98 5000 my 0.97 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27344) (66476, 27344) (322587, 27316) (66476, 27316) STARTING UNMASKING ROUND 100 eng cc unigrams 100 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.97 2688 gb 0.96 0.97 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.99 3788 us 0.97 0.98 0.97 5000 za 0.95 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27316) (66476, 27316) (322587, 27288) (66476, 27288) STARTING UNMASKING ROUND 101 eng cc unigrams 101 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.97 2688 gb 0.96 0.97 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.99 3788 us 0.97 0.98 0.97 5000 za 0.95 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27288) (66476, 27288) (322587, 27260) (66476, 27260) STARTING UNMASKING ROUND 102 eng cc unigrams 102 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.97 2688 gb 0.96 0.97 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.98 5000 my 0.97 0.98 0.98 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27260) (66476, 27260) (322587, 27232) (66476, 27232) STARTING UNMASKING ROUND 103 eng cc unigrams 103 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.97 2688 gb 0.96 0.97 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.99 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27232) (66476, 27232) (322587, 27204) (66476, 27204) STARTING UNMASKING ROUND 104 eng cc unigrams 104 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.97 0.96 5000 ie 0.97 0.96 0.97 5000 in 0.99 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.99 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.98 3788 us 0.97 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27204) (66476, 27204) (322587, 27176) (66476, 27176) STARTING UNMASKING ROUND 105 eng cc unigrams 105 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.97 2688 gb 0.96 0.97 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.99 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 1.00 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.98 0.98 0.98 66476 Reducing feature vectors. (322587, 27176) (66476, 27176) (322587, 27148) (66476, 27148) STARTING UNMASKING ROUND 106 eng cc unigrams 106 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.97 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.97 5000 in 0.98 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 27148) (66476, 27148) (322587, 27121) (66476, 27121) STARTING UNMASKING ROUND 107 eng cc unigrams 107 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.98 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 27121) (66476, 27121) (322587, 27093) (66476, 27093) STARTING UNMASKING ROUND 108 eng cc unigrams 108 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.97 5000 in 0.98 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.96 0.94 0.95 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 27093) (66476, 27093) (322587, 27065) (66476, 27065) STARTING UNMASKING ROUND 109 eng cc unigrams 109 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.97 5000 in 0.98 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 1.00 0.99 5000 nz 0.95 0.94 0.95 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 27065) (66476, 27065) (322587, 27037) (66476, 27037) STARTING UNMASKING ROUND 110 eng cc unigrams 110 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.98 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 1.00 0.99 5000 nz 0.95 0.94 0.95 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 27037) (66476, 27037) (322587, 27009) (66476, 27009) STARTING UNMASKING ROUND 111 eng cc unigrams 111 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 1.00 0.99 5000 nz 0.95 0.94 0.95 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 27009) (66476, 27009) (322587, 26981) (66476, 26981) STARTING UNMASKING ROUND 112 eng cc unigrams 112 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.97 0.97 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.94 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26981) (66476, 26981) (322587, 26954) (66476, 26954) STARTING UNMASKING ROUND 113 eng cc unigrams 113 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.97 5000 in 0.98 0.98 0.98 5000 my 0.97 0.98 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.94 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.99 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.96 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26954) (66476, 26954) (322587, 26926) (66476, 26926) STARTING UNMASKING ROUND 114 eng cc unigrams 114 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.97 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.94 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.95 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26926) (66476, 26926) (322587, 26898) (66476, 26898) STARTING UNMASKING ROUND 115 eng cc unigrams 115 precision recall f1-score support au 0.98 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.94 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.98 0.97 5000 za 0.95 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26898) (66476, 26898) (322587, 26870) (66476, 26870) STARTING UNMASKING ROUND 116 eng cc unigrams 116 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26870) (66476, 26870) (322587, 26842) (66476, 26842) STARTING UNMASKING ROUND 117 eng cc unigrams 117 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.95 0.96 0.96 5000 ie 0.97 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26842) (66476, 26842) (322587, 26814) (66476, 26814) STARTING UNMASKING ROUND 118 eng cc unigrams 118 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26814) (66476, 26814) (322587, 26786) (66476, 26786) STARTING UNMASKING ROUND 119 eng cc unigrams 119 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.97 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26786) (66476, 26786) (322587, 26758) (66476, 26758) STARTING UNMASKING ROUND 120 eng cc unigrams 120 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26758) (66476, 26758) (322587, 26730) (66476, 26730) STARTING UNMASKING ROUND 121 eng cc unigrams 121 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.95 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.95 0.96 0.96 5000 ie 0.97 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26730) (66476, 26730) (322587, 26702) (66476, 26702) STARTING UNMASKING ROUND 122 eng cc unigrams 122 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.94 0.96 5000 ch 0.98 0.95 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26702) (66476, 26702) (322587, 26674) (66476, 26674) STARTING UNMASKING ROUND 123 eng cc unigrams 123 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.94 0.96 5000 ch 0.98 0.96 0.97 2688 gb 0.96 0.96 0.96 5000 ie 0.97 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26674) (66476, 26674) (322587, 26646) (66476, 26646) STARTING UNMASKING ROUND 124 eng cc unigrams 124 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.96 0.97 2688 gb 0.95 0.96 0.96 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26646) (66476, 26646) (322587, 26619) (66476, 26619) STARTING UNMASKING ROUND 125 eng cc unigrams 125 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.94 0.96 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.96 0.96 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.95 0.93 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.94 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26619) (66476, 26619) (322587, 26591) (66476, 26591) STARTING UNMASKING ROUND 126 eng cc unigrams 126 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.96 0.96 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.97 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.95 0.92 0.94 5000 ph 0.99 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.94 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26591) (66476, 26591) (322587, 26564) (66476, 26564) STARTING UNMASKING ROUND 127 eng cc unigrams 127 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.95 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26564) (66476, 26564) (322587, 26536) (66476, 26536) STARTING UNMASKING ROUND 128 eng cc unigrams 128 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26536) (66476, 26536) (322587, 26508) (66476, 26508) STARTING UNMASKING ROUND 129 eng cc unigrams 129 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.94 0.93 0.94 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26508) (66476, 26508) (322587, 26480) (66476, 26480) STARTING UNMASKING ROUND 130 eng cc unigrams 130 precision recall f1-score support au 0.97 0.98 0.98 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.97 5000 ng 0.99 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26480) (66476, 26480) (322587, 26453) (66476, 26453) STARTING UNMASKING ROUND 131 eng cc unigrams 131 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.96 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.97 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26453) (66476, 26453) (322587, 26425) (66476, 26425) STARTING UNMASKING ROUND 132 eng cc unigrams 132 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.96 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.95 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26425) (66476, 26425) (322587, 26397) (66476, 26397) STARTING UNMASKING ROUND 133 eng cc unigrams 133 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.94 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26397) (66476, 26397) (322587, 26370) (66476, 26370) STARTING UNMASKING ROUND 134 eng cc unigrams 134 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.97 0.98 5000 my 0.96 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.94 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26370) (66476, 26370) (322587, 26342) (66476, 26342) STARTING UNMASKING ROUND 135 eng cc unigrams 135 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.97 0.98 5000 my 0.96 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.95 0.97 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26342) (66476, 26342) (322587, 26314) (66476, 26314) STARTING UNMASKING ROUND 136 eng cc unigrams 136 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.97 0.98 5000 my 0.96 0.97 0.97 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.94 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26314) (66476, 26314) (322587, 26286) (66476, 26286) STARTING UNMASKING ROUND 137 eng cc unigrams 137 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.97 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.94 0.97 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26286) (66476, 26286) (322587, 26258) (66476, 26258) STARTING UNMASKING ROUND 138 eng cc unigrams 138 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.96 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 0.99 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.94 0.98 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26258) (66476, 26258) (322587, 26230) (66476, 26230) STARTING UNMASKING ROUND 139 eng cc unigrams 139 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.97 0.94 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.95 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.94 0.97 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26230) (66476, 26230) (322587, 26202) (66476, 26202) STARTING UNMASKING ROUND 140 eng cc unigrams 140 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.96 0.94 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.95 0.96 5000 in 0.98 0.98 0.98 5000 my 0.96 0.97 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.94 0.97 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26202) (66476, 26202) (322587, 26174) (66476, 26174) STARTING UNMASKING ROUND 141 eng cc unigrams 141 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.96 0.94 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.95 0.95 0.95 5000 ie 0.95 0.95 0.95 5000 in 0.98 0.97 0.98 5000 my 0.96 0.97 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.95 0.97 0.96 5000 za 0.94 0.97 0.96 5000 avg / total 0.97 0.96 0.96 66476 Reducing feature vectors. (322587, 26174) (66476, 26174) (322587, 26146) (66476, 26146) STARTING UNMASKING ROUND 142 eng cc unigrams 142 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.96 0.94 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.94 0.95 0.95 5000 ie 0.96 0.95 0.96 5000 in 0.98 0.97 0.98 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.94 0.97 0.96 5000 avg / total 0.97 0.97 0.97 66476 Reducing feature vectors. (322587, 26146) (66476, 26146) (322587, 26120) (66476, 26120) STARTING UNMASKING ROUND 143 eng cc unigrams 143 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.96 0.94 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.95 0.96 5000 in 0.98 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.98 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.96 0.97 0.96 5000 za 0.94 0.98 0.96 5000 avg / total 0.97 0.96 0.96 66476 Reducing feature vectors. (322587, 26120) (66476, 26120) (322587, 26092) (66476, 26092) STARTING UNMASKING ROUND 144 eng cc unigrams 144 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.96 0.94 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.95 0.95 0.95 5000 ie 0.96 0.95 0.96 5000 in 0.98 0.97 0.98 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.97 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.94 0.98 0.96 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 26092) (66476, 26092) (322587, 26064) (66476, 26064) STARTING UNMASKING ROUND 145 eng cc unigrams 145 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.96 0.94 0.95 5000 ch 0.97 0.95 0.96 2688 gb 0.95 0.94 0.94 5000 ie 0.96 0.95 0.95 5000 in 0.98 0.97 0.98 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.97 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.98 0.98 3788 us 0.95 0.97 0.96 5000 za 0.94 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 26064) (66476, 26064) (322587, 26036) (66476, 26036) STARTING UNMASKING ROUND 146 eng cc unigrams 146 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.97 0.95 0.96 2688 gb 0.95 0.94 0.94 5000 ie 0.96 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.93 5000 ph 0.98 0.97 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.94 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 26036) (66476, 26036) (322587, 26008) (66476, 26008) STARTING UNMASKING ROUND 147 eng cc unigrams 147 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.95 0.94 0.94 5000 ie 0.96 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.97 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 26008) (66476, 26008) (322587, 25980) (66476, 25980) STARTING UNMASKING ROUND 148 eng cc unigrams 148 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.94 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.96 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.93 5000 ph 0.98 0.97 0.98 5000 pk 1.00 0.99 1.00 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.94 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25980) (66476, 25980) (322587, 25953) (66476, 25953) STARTING UNMASKING ROUND 149 eng cc unigrams 149 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.93 5000 ph 0.98 0.97 0.98 5000 pk 1.00 1.00 1.00 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.94 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25953) (66476, 25953) (322587, 25925) (66476, 25925) STARTING UNMASKING ROUND 150 eng cc unigrams 150 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.94 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.93 5000 ph 0.98 0.97 0.98 5000 pk 0.99 1.00 1.00 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25925) (66476, 25925) (322587, 25897) (66476, 25897) STARTING UNMASKING ROUND 151 eng cc unigrams 151 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.94 0.96 2688 gb 0.95 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.92 5000 ph 0.98 0.97 0.98 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25897) (66476, 25897) (322587, 25869) (66476, 25869) STARTING UNMASKING ROUND 152 eng cc unigrams 152 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.94 0.96 2688 gb 0.95 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.93 5000 ph 0.98 0.97 0.98 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25869) (66476, 25869) (322587, 25841) (66476, 25841) STARTING UNMASKING ROUND 153 eng cc unigrams 153 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.95 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.98 0.97 0.97 5000 my 0.95 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.93 5000 ph 0.98 0.97 0.98 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25841) (66476, 25841) (322587, 25813) (66476, 25813) STARTING UNMASKING ROUND 154 eng cc unigrams 154 precision recall f1-score support au 0.97 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.95 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.98 0.97 0.97 5000 my 0.95 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.92 0.93 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25813) (66476, 25813) (322587, 25785) (66476, 25785) STARTING UNMASKING ROUND 155 eng cc unigrams 155 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.95 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.93 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25785) (66476, 25785) (322587, 25757) (66476, 25757) STARTING UNMASKING ROUND 156 eng cc unigrams 156 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.94 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25757) (66476, 25757) (322587, 25729) (66476, 25729) STARTING UNMASKING ROUND 157 eng cc unigrams 157 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.95 5000 ch 0.98 0.94 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25729) (66476, 25729) (322587, 25701) (66476, 25701) STARTING UNMASKING ROUND 158 eng cc unigrams 158 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.94 5000 ch 0.98 0.94 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25701) (66476, 25701) (322587, 25673) (66476, 25673) STARTING UNMASKING ROUND 159 eng cc unigrams 159 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.94 5000 ch 0.98 0.94 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.96 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25673) (66476, 25673) (322587, 25645) (66476, 25645) STARTING UNMASKING ROUND 160 eng cc unigrams 160 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.94 5000 ch 0.97 0.94 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25645) (66476, 25645) (322587, 25617) (66476, 25617) STARTING UNMASKING ROUND 161 eng cc unigrams 161 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.94 5000 ch 0.97 0.94 0.96 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.94 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.95 0.96 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25617) (66476, 25617) (322587, 25589) (66476, 25589) STARTING UNMASKING ROUND 162 eng cc unigrams 162 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.95 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.97 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25589) (66476, 25589) (322587, 25561) (66476, 25561) STARTING UNMASKING ROUND 163 eng cc unigrams 163 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.96 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25561) (66476, 25561) (322587, 25533) (66476, 25533) STARTING UNMASKING ROUND 164 eng cc unigrams 164 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.94 0.94 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.95 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.98 3788 us 0.95 0.96 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25533) (66476, 25533) (322587, 25505) (66476, 25505) STARTING UNMASKING ROUND 165 eng cc unigrams 165 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.93 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.94 0.93 0.94 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.95 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.95 0.96 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25505) (66476, 25505) (322587, 25477) (66476, 25477) STARTING UNMASKING ROUND 166 eng cc unigrams 166 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.94 0.93 0.93 5000 ie 0.95 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.95 0.96 0.96 5000 ng 0.98 0.99 0.99 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.95 0.96 0.96 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25477) (66476, 25477) (322587, 25449) (66476, 25449) STARTING UNMASKING ROUND 167 eng cc unigrams 167 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.92 0.94 5000 ch 0.97 0.94 0.96 2688 gb 0.94 0.93 0.93 5000 ie 0.94 0.95 0.95 5000 in 0.97 0.97 0.97 5000 my 0.95 0.96 0.96 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25449) (66476, 25449) (322587, 25421) (66476, 25421) STARTING UNMASKING ROUND 168 eng cc unigrams 168 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.95 0.94 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25421) (66476, 25421) (322587, 25394) (66476, 25394) STARTING UNMASKING ROUND 169 eng cc unigrams 169 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.95 0.94 5000 in 0.97 0.97 0.97 5000 my 0.95 0.96 0.96 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25394) (66476, 25394) (322587, 25366) (66476, 25366) STARTING UNMASKING ROUND 170 eng cc unigrams 170 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25366) (66476, 25366) (322587, 25338) (66476, 25338) STARTING UNMASKING ROUND 171 eng cc unigrams 171 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.95 0.94 5000 in 0.97 0.97 0.97 5000 my 0.95 0.96 0.96 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25338) (66476, 25338) (322587, 25310) (66476, 25310) STARTING UNMASKING ROUND 172 eng cc unigrams 172 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.95 0.94 5000 in 0.97 0.97 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25310) (66476, 25310) (322587, 25282) (66476, 25282) STARTING UNMASKING ROUND 173 eng cc unigrams 173 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.94 0.93 0.93 5000 ie 0.94 0.95 0.94 5000 in 0.97 0.97 0.97 5000 my 0.96 0.96 0.96 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25282) (66476, 25282) (322587, 25254) (66476, 25254) STARTING UNMASKING ROUND 174 eng cc unigrams 174 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.97 0.97 5000 my 0.95 0.96 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.96 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25254) (66476, 25254) (322587, 25226) (66476, 25226) STARTING UNMASKING ROUND 175 eng cc unigrams 175 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.97 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25226) (66476, 25226) (322587, 25198) (66476, 25198) STARTING UNMASKING ROUND 176 eng cc unigrams 176 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.96 0.92 0.94 5000 ch 0.97 0.94 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.95 0.94 5000 in 0.97 0.97 0.97 5000 my 0.95 0.96 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.90 0.92 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25198) (66476, 25198) (322587, 25170) (66476, 25170) STARTING UNMASKING ROUND 177 eng cc unigrams 177 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.94 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.97 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25170) (66476, 25170) (322587, 25142) (66476, 25142) STARTING UNMASKING ROUND 178 eng cc unigrams 178 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.97 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.91 0.92 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25142) (66476, 25142) (322587, 25114) (66476, 25114) STARTING UNMASKING ROUND 179 eng cc unigrams 179 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.94 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.97 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.93 0.90 0.92 5000 ph 0.98 0.97 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.97 0.95 5000 avg / total 0.96 0.96 0.96 66476 Reducing feature vectors. (322587, 25114) (66476, 25114) (322587, 25086) (66476, 25086) STARTING UNMASKING ROUND 180 eng cc unigrams 180 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.94 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.96 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.92 0.91 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.96 0.95 5000 avg / total 0.96 0.96 0.95 66476 Reducing feature vectors. (322587, 25086) (66476, 25086) (322587, 25058) (66476, 25058) STARTING UNMASKING ROUND 181 eng cc unigrams 181 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.94 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.96 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.93 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 25058) (66476, 25058) (322587, 25030) (66476, 25030) STARTING UNMASKING ROUND 182 eng cc unigrams 182 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.96 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 25030) (66476, 25030) (322587, 25002) (66476, 25002) STARTING UNMASKING ROUND 183 eng cc unigrams 183 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.94 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.96 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 25002) (66476, 25002) (322587, 24974) (66476, 24974) STARTING UNMASKING ROUND 184 eng cc unigrams 184 precision recall f1-score support au 0.96 0.98 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.93 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.96 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24974) (66476, 24974) (322587, 24946) (66476, 24946) STARTING UNMASKING ROUND 185 eng cc unigrams 185 precision recall f1-score support au 0.96 0.97 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.92 0.93 5000 ie 0.93 0.94 0.94 5000 in 0.97 0.96 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24946) (66476, 24946) (322587, 24918) (66476, 24918) STARTING UNMASKING ROUND 186 eng cc unigrams 186 precision recall f1-score support au 0.96 0.97 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.92 0.93 5000 ie 0.93 0.94 0.94 5000 in 0.97 0.96 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.95 0.95 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24918) (66476, 24918) (322587, 24890) (66476, 24890) STARTING UNMASKING ROUND 187 eng cc unigrams 187 precision recall f1-score support au 0.96 0.97 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.92 0.93 5000 ie 0.93 0.94 0.94 5000 in 0.97 0.96 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.98 0.97 0.97 3788 us 0.94 0.95 0.95 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24890) (66476, 24890) (322587, 24862) (66476, 24862) STARTING UNMASKING ROUND 188 eng cc unigrams 188 precision recall f1-score support au 0.96 0.97 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.92 0.92 5000 ie 0.93 0.94 0.94 5000 in 0.97 0.96 0.97 5000 my 0.95 0.95 0.95 5000 ng 0.98 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.94 0.96 0.95 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24862) (66476, 24862) (322587, 24834) (66476, 24834) STARTING UNMASKING ROUND 189 eng cc unigrams 189 precision recall f1-score support au 0.96 0.97 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.92 0.93 5000 ie 0.94 0.94 0.94 5000 in 0.97 0.96 0.96 5000 my 0.95 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.94 0.96 0.95 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24834) (66476, 24834) (322587, 24806) (66476, 24806) STARTING UNMASKING ROUND 190 eng cc unigrams 190 precision recall f1-score support au 0.96 0.97 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.92 0.92 5000 ie 0.93 0.94 0.94 5000 in 0.97 0.96 0.96 5000 my 0.95 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24806) (66476, 24806) (322587, 24778) (66476, 24778) STARTING UNMASKING ROUND 191 eng cc unigrams 191 precision recall f1-score support au 0.96 0.97 0.97 5000 ca 0.95 0.92 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.92 0.92 5000 ie 0.93 0.94 0.94 5000 in 0.97 0.96 0.96 5000 my 0.95 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.90 0.91 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24778) (66476, 24778) (322587, 24750) (66476, 24750) STARTING UNMASKING ROUND 192 eng cc unigrams 192 precision recall f1-score support au 0.96 0.97 0.96 5000 ca 0.95 0.91 0.93 5000 ch 0.97 0.94 0.95 2688 gb 0.92 0.92 0.92 5000 ie 0.93 0.94 0.93 5000 in 0.97 0.96 0.96 5000 my 0.95 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.91 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24750) (66476, 24750) (322587, 24723) (66476, 24723) STARTING UNMASKING ROUND 193 eng cc unigrams 193 precision recall f1-score support au 0.96 0.97 0.96 5000 ca 0.95 0.91 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.92 0.92 0.92 5000 ie 0.93 0.94 0.93 5000 in 0.97 0.96 0.96 5000 my 0.94 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.91 5000 ph 0.98 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24723) (66476, 24723) (322587, 24695) (66476, 24695) STARTING UNMASKING ROUND 194 eng cc unigrams 194 precision recall f1-score support au 0.96 0.97 0.96 5000 ca 0.95 0.91 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.92 0.92 0.92 5000 ie 0.93 0.94 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.94 0.95 0.94 5000 za 0.91 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24695) (66476, 24695) (322587, 24668) (66476, 24668) STARTING UNMASKING ROUND 195 eng cc unigrams 195 precision recall f1-score support au 0.96 0.97 0.96 5000 ca 0.95 0.91 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.92 0.92 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24668) (66476, 24668) (322587, 24641) (66476, 24641) STARTING UNMASKING ROUND 196 eng cc unigrams 196 precision recall f1-score support au 0.96 0.97 0.97 5000 ca 0.94 0.91 0.93 5000 ch 0.97 0.93 0.95 2688 gb 0.93 0.92 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24641) (66476, 24641) (322587, 24613) (66476, 24613) STARTING UNMASKING ROUND 197 eng cc unigrams 197 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.93 5000 ch 0.97 0.92 0.94 2688 gb 0.92 0.92 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.94 0.95 0.94 5000 za 0.92 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24613) (66476, 24613) (322587, 24585) (66476, 24585) STARTING UNMASKING ROUND 198 eng cc unigrams 198 precision recall f1-score support au 0.96 0.97 0.96 5000 ca 0.94 0.91 0.93 5000 ch 0.97 0.92 0.94 2688 gb 0.92 0.92 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24585) (66476, 24585) (322587, 24557) (66476, 24557) STARTING UNMASKING ROUND 199 eng cc unigrams 199 precision recall f1-score support au 0.96 0.97 0.96 5000 ca 0.95 0.91 0.93 5000 ch 0.96 0.93 0.94 2688 gb 0.92 0.92 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.93 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24557) (66476, 24557) (322587, 24529) (66476, 24529) STARTING UNMASKING ROUND 200 eng cc unigrams 200 precision recall f1-score support au 0.96 0.97 0.96 5000 ca 0.94 0.91 0.93 5000 ch 0.97 0.92 0.94 2688 gb 0.92 0.92 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.93 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24529) (66476, 24529) (322587, 24501) (66476, 24501) STARTING UNMASKING ROUND 201 eng cc unigrams 201 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.93 5000 ch 0.96 0.92 0.94 2688 gb 0.92 0.91 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.89 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24501) (66476, 24501) (322587, 24473) (66476, 24473) STARTING UNMASKING ROUND 202 eng cc unigrams 202 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.95 0.91 0.93 5000 ch 0.97 0.92 0.94 2688 gb 0.92 0.92 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.95 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.89 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.97 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.93 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24473) (66476, 24473) (322587, 24445) (66476, 24445) STARTING UNMASKING ROUND 203 eng cc unigrams 203 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.93 5000 ch 0.97 0.92 0.94 2688 gb 0.92 0.91 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.92 0.88 0.90 5000 ph 0.97 0.96 0.97 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.94 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24445) (66476, 24445) (322587, 24417) (66476, 24417) STARTING UNMASKING ROUND 204 eng cc unigrams 204 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.93 5000 ch 0.96 0.92 0.94 2688 gb 0.92 0.91 0.91 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.89 0.90 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.97 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.93 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24417) (66476, 24417) (322587, 24389) (66476, 24389) STARTING UNMASKING ROUND 205 eng cc unigrams 205 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.93 5000 ch 0.96 0.92 0.94 2688 gb 0.92 0.91 0.92 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.89 0.90 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.93 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24389) (66476, 24389) (322587, 24361) (66476, 24361) STARTING UNMASKING ROUND 206 eng cc unigrams 206 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.93 5000 ch 0.96 0.92 0.94 2688 gb 0.92 0.91 0.91 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.90 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.93 0.95 0.94 5000 za 0.91 0.95 0.93 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24361) (66476, 24361) (322587, 24333) (66476, 24333) STARTING UNMASKING ROUND 207 eng cc unigrams 207 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.93 5000 ch 0.97 0.92 0.94 2688 gb 0.92 0.91 0.91 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.90 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.93 0.95 0.94 5000 za 0.91 0.96 0.93 5000 avg / total 0.95 0.95 0.95 66476 Reducing feature vectors. (322587, 24333) (66476, 24333) (322587, 24305) (66476, 24305) STARTING UNMASKING ROUND 208 eng cc unigrams 208 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.92 5000 ch 0.97 0.92 0.94 2688 gb 0.92 0.91 0.91 5000 ie 0.92 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.95 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.90 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.93 0.94 0.94 5000 za 0.91 0.96 0.93 5000 avg / total 0.95 0.95 0.94 66476 Reducing feature vectors. (322587, 24305) (66476, 24305) (322587, 24278) (66476, 24278) STARTING UNMASKING ROUND 209 eng cc unigrams 209 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.92 5000 ch 0.97 0.92 0.94 2688 gb 0.92 0.91 0.91 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.90 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.93 0.95 0.94 5000 za 0.91 0.95 0.93 5000 avg / total 0.95 0.94 0.94 66476 Reducing feature vectors. (322587, 24278) (66476, 24278) (322587, 24250) (66476, 24250) STARTING UNMASKING ROUND 210 eng cc unigrams 210 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.92 5000 ch 0.96 0.92 0.94 2688 gb 0.92 0.91 0.91 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.89 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.92 0.95 0.94 5000 za 0.91 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24250) (66476, 24250) (322587, 24222) (66476, 24222) STARTING UNMASKING ROUND 211 eng cc unigrams 211 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.92 5000 ch 0.96 0.92 0.94 2688 gb 0.92 0.91 0.91 5000 ie 0.92 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.91 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24222) (66476, 24222) (322587, 24195) (66476, 24195) STARTING UNMASKING ROUND 212 eng cc unigrams 212 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.92 5000 ch 0.96 0.92 0.94 2688 gb 0.92 0.91 0.91 5000 ie 0.93 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.89 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.93 0.95 0.94 5000 za 0.91 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24195) (66476, 24195) (322587, 24167) (66476, 24167) STARTING UNMASKING ROUND 213 eng cc unigrams 213 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.92 5000 ch 0.96 0.91 0.94 2688 gb 0.91 0.91 0.91 5000 ie 0.92 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.89 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.93 0.94 0.93 5000 za 0.91 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24167) (66476, 24167) (322587, 24139) (66476, 24139) STARTING UNMASKING ROUND 214 eng cc unigrams 214 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.92 5000 ch 0.96 0.91 0.94 2688 gb 0.91 0.91 0.91 5000 ie 0.92 0.93 0.93 5000 in 0.96 0.96 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.89 5000 ph 0.97 0.96 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.91 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24139) (66476, 24139) (322587, 24112) (66476, 24112) STARTING UNMASKING ROUND 215 eng cc unigrams 215 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.94 0.91 0.92 5000 ch 0.96 0.91 0.94 2688 gb 0.91 0.90 0.91 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.91 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24112) (66476, 24112) (322587, 24084) (66476, 24084) STARTING UNMASKING ROUND 216 eng cc unigrams 216 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.91 0.92 5000 ch 0.96 0.91 0.94 2688 gb 0.91 0.90 0.91 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.97 0.99 0.98 5000 nz 0.91 0.88 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.91 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24084) (66476, 24084) (322587, 24056) (66476, 24056) STARTING UNMASKING ROUND 217 eng cc unigrams 217 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.93 2688 gb 0.91 0.90 0.91 5000 ie 0.92 0.93 0.93 5000 in 0.96 0.95 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.90 0.88 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24056) (66476, 24056) (322587, 24028) (66476, 24028) STARTING UNMASKING ROUND 218 eng cc unigrams 218 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.93 2688 gb 0.91 0.90 0.91 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.91 0.88 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24028) (66476, 24028) (322587, 24000) (66476, 24000) STARTING UNMASKING ROUND 219 eng cc unigrams 219 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.93 2688 gb 0.91 0.90 0.91 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.91 0.88 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 24000) (66476, 24000) (322587, 23972) (66476, 23972) STARTING UNMASKING ROUND 220 eng cc unigrams 220 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.94 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.91 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.97 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23972) (66476, 23972) (322587, 23945) (66476, 23945) STARTING UNMASKING ROUND 221 eng cc unigrams 221 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.94 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.95 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.91 0.88 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23945) (66476, 23945) (322587, 23917) (66476, 23917) STARTING UNMASKING ROUND 222 eng cc unigrams 222 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.94 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.91 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.93 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23917) (66476, 23917) (322587, 23889) (66476, 23889) STARTING UNMASKING ROUND 223 eng cc unigrams 223 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.93 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.91 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23889) (66476, 23889) (322587, 23861) (66476, 23861) STARTING UNMASKING ROUND 224 eng cc unigrams 224 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.93 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.96 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.97 5000 nz 0.91 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23861) (66476, 23861) (322587, 23833) (66476, 23833) STARTING UNMASKING ROUND 225 eng cc unigrams 225 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.93 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.95 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.91 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23833) (66476, 23833) (322587, 23805) (66476, 23805) STARTING UNMASKING ROUND 226 eng cc unigrams 226 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.93 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.95 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.90 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23805) (66476, 23805) (322587, 23777) (66476, 23777) STARTING UNMASKING ROUND 227 eng cc unigrams 227 precision recall f1-score support au 0.95 0.97 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.91 0.93 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.95 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.98 5000 nz 0.90 0.87 0.89 5000 ph 0.96 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23777) (66476, 23777) (322587, 23749) (66476, 23749) STARTING UNMASKING ROUND 228 eng cc unigrams 228 precision recall f1-score support au 0.95 0.96 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.90 0.93 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.92 0.92 5000 in 0.96 0.95 0.95 5000 my 0.94 0.94 0.94 5000 ng 0.96 0.99 0.97 5000 nz 0.90 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.90 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23749) (66476, 23749) (322587, 23721) (66476, 23721) STARTING UNMASKING ROUND 229 eng cc unigrams 229 precision recall f1-score support au 0.95 0.96 0.96 5000 ca 0.93 0.90 0.92 5000 ch 0.96 0.90 0.93 2688 gb 0.91 0.90 0.90 5000 ie 0.92 0.93 0.92 5000 in 0.96 0.95 0.95 5000 my 0.93 0.94 0.94 5000 ng 0.96 0.99 0.97 5000 nz 0.90 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.89 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23721) (66476, 23721) (322587, 23693) (66476, 23693) STARTING UNMASKING ROUND 230 eng cc unigrams 230 precision recall f1-score support au 0.95 0.96 0.96 5000 ca 0.93 0.90 0.91 5000 ch 0.96 0.90 0.93 2688 gb 0.91 0.89 0.90 5000 ie 0.92 0.92 0.92 5000 in 0.96 0.95 0.95 5000 my 0.93 0.94 0.93 5000 ng 0.96 0.99 0.97 5000 nz 0.90 0.87 0.89 5000 ph 0.96 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.89 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23693) (66476, 23693) (322587, 23666) (66476, 23666) STARTING UNMASKING ROUND 231 eng cc unigrams 231 precision recall f1-score support au 0.95 0.96 0.96 5000 ca 0.93 0.90 0.91 5000 ch 0.96 0.90 0.93 2688 gb 0.91 0.89 0.90 5000 ie 0.91 0.93 0.92 5000 in 0.96 0.95 0.95 5000 my 0.93 0.94 0.93 5000 ng 0.96 0.99 0.97 5000 nz 0.90 0.87 0.89 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.89 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23666) (66476, 23666) (322587, 23638) (66476, 23638) STARTING UNMASKING ROUND 232 eng cc unigrams 232 precision recall f1-score support au 0.95 0.96 0.95 5000 ca 0.93 0.90 0.91 5000 ch 0.96 0.90 0.93 2688 gb 0.91 0.89 0.90 5000 ie 0.91 0.93 0.92 5000 in 0.96 0.95 0.95 5000 my 0.93 0.94 0.93 5000 ng 0.96 0.99 0.97 5000 nz 0.90 0.87 0.88 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.92 0.94 0.93 5000 za 0.89 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23638) (66476, 23638) (322587, 23610) (66476, 23610) STARTING UNMASKING ROUND 233 eng cc unigrams 233 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.93 0.90 0.91 5000 ch 0.96 0.89 0.92 2688 gb 0.91 0.89 0.90 5000 ie 0.91 0.92 0.92 5000 in 0.96 0.95 0.95 5000 my 0.93 0.94 0.93 5000 ng 0.96 0.99 0.97 5000 nz 0.90 0.87 0.88 5000 ph 0.97 0.95 0.96 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.92 0.94 0.93 5000 za 0.89 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23610) (66476, 23610) (322587, 23582) (66476, 23582) STARTING UNMASKING ROUND 234 eng cc unigrams 234 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.93 0.90 0.91 5000 ch 0.96 0.89 0.93 2688 gb 0.90 0.89 0.90 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.95 0.95 5000 my 0.93 0.94 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.87 0.88 5000 ph 0.97 0.95 0.96 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.92 0.94 0.93 5000 za 0.89 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23582) (66476, 23582) (322587, 23554) (66476, 23554) STARTING UNMASKING ROUND 235 eng cc unigrams 235 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.93 0.90 0.91 5000 ch 0.96 0.90 0.93 2688 gb 0.91 0.89 0.90 5000 ie 0.91 0.92 0.92 5000 in 0.96 0.95 0.95 5000 my 0.93 0.94 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.87 0.88 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.91 0.94 0.93 5000 za 0.89 0.95 0.92 5000 avg / total 0.94 0.94 0.94 66476 Reducing feature vectors. (322587, 23554) (66476, 23554) (322587, 23526) (66476, 23526) STARTING UNMASKING ROUND 236 eng cc unigrams 236 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.93 0.90 0.91 5000 ch 0.96 0.89 0.93 2688 gb 0.90 0.89 0.90 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.95 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.87 0.88 5000 ph 0.97 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.96 0.96 3788 us 0.91 0.94 0.92 5000 za 0.89 0.95 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23526) (66476, 23526) (322587, 23498) (66476, 23498) STARTING UNMASKING ROUND 237 eng cc unigrams 237 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.90 0.91 5000 ch 0.96 0.89 0.92 2688 gb 0.90 0.89 0.90 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.95 0.95 5000 my 0.93 0.94 0.93 5000 ng 0.96 0.99 0.97 5000 nz 0.90 0.87 0.88 5000 ph 0.96 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.91 0.94 0.92 5000 za 0.89 0.95 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23498) (66476, 23498) (322587, 23470) (66476, 23470) STARTING UNMASKING ROUND 238 eng cc unigrams 238 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.90 0.91 5000 ch 0.96 0.90 0.93 2688 gb 0.90 0.89 0.90 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.95 0.95 5000 my 0.93 0.94 0.93 5000 ng 0.96 0.99 0.97 5000 nz 0.90 0.87 0.88 5000 ph 0.96 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.91 0.94 0.92 5000 za 0.89 0.95 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23470) (66476, 23470) (322587, 23442) (66476, 23442) STARTING UNMASKING ROUND 239 eng cc unigrams 239 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.96 0.90 0.93 2688 gb 0.90 0.89 0.90 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.95 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.86 0.88 5000 ph 0.96 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.91 0.94 0.92 5000 za 0.89 0.95 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23442) (66476, 23442) (322587, 23414) (66476, 23414) STARTING UNMASKING ROUND 240 eng cc unigrams 240 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.96 0.89 0.92 2688 gb 0.90 0.88 0.89 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.94 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.86 0.88 5000 ph 0.96 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.91 0.94 0.92 5000 za 0.89 0.95 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23414) (66476, 23414) (322587, 23386) (66476, 23386) STARTING UNMASKING ROUND 241 eng cc unigrams 241 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.96 0.89 0.92 2688 gb 0.90 0.89 0.89 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.95 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.86 0.88 5000 ph 0.96 0.95 0.96 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.91 0.94 0.92 5000 za 0.89 0.95 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23386) (66476, 23386) (322587, 23358) (66476, 23358) STARTING UNMASKING ROUND 242 eng cc unigrams 242 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.96 0.89 0.92 2688 gb 0.90 0.89 0.90 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.94 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.99 0.97 5000 nz 0.90 0.87 0.88 5000 ph 0.96 0.95 0.95 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.91 0.94 0.92 5000 za 0.89 0.95 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23358) (66476, 23358) (322587, 23331) (66476, 23331) STARTING UNMASKING ROUND 243 eng cc unigrams 243 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.95 0.89 0.92 2688 gb 0.90 0.89 0.89 5000 ie 0.91 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.86 0.88 5000 ph 0.96 0.95 0.95 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.95 3788 us 0.91 0.94 0.92 5000 za 0.89 0.95 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23331) (66476, 23331) (322587, 23303) (66476, 23303) STARTING UNMASKING ROUND 244 eng cc unigrams 244 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.95 0.89 0.92 2688 gb 0.90 0.88 0.89 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.94 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.86 0.88 5000 ph 0.96 0.95 0.95 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.95 3788 us 0.91 0.94 0.92 5000 za 0.89 0.94 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23303) (66476, 23303) (322587, 23275) (66476, 23275) STARTING UNMASKING ROUND 245 eng cc unigrams 245 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.95 0.89 0.92 2688 gb 0.90 0.89 0.89 5000 ie 0.91 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.86 0.88 5000 ph 0.96 0.94 0.95 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.95 3788 us 0.91 0.94 0.92 5000 za 0.89 0.94 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23275) (66476, 23275) (322587, 23247) (66476, 23247) STARTING UNMASKING ROUND 246 eng cc unigrams 246 precision recall f1-score support au 0.94 0.96 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.95 0.89 0.92 2688 gb 0.90 0.88 0.89 5000 ie 0.91 0.92 0.92 5000 in 0.95 0.94 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.90 0.86 0.88 5000 ph 0.96 0.95 0.95 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.91 0.94 0.92 5000 za 0.89 0.94 0.92 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23247) (66476, 23247) (322587, 23220) (66476, 23220) STARTING UNMASKING ROUND 247 eng cc unigrams 247 precision recall f1-score support au 0.94 0.95 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.96 0.89 0.92 2688 gb 0.90 0.88 0.89 5000 ie 0.91 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.96 0.95 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.95 0.96 3788 us 0.91 0.93 0.92 5000 za 0.89 0.94 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23220) (66476, 23220) (322587, 23192) (66476, 23192) STARTING UNMASKING ROUND 248 eng cc unigrams 248 precision recall f1-score support au 0.94 0.95 0.95 5000 ca 0.92 0.89 0.91 5000 ch 0.95 0.89 0.92 2688 gb 0.90 0.88 0.89 5000 ie 0.91 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.86 0.87 5000 ph 0.96 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.95 0.95 3788 us 0.91 0.93 0.92 5000 za 0.88 0.94 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23192) (66476, 23192) (322587, 23164) (66476, 23164) STARTING UNMASKING ROUND 249 eng cc unigrams 249 precision recall f1-score support au 0.94 0.95 0.95 5000 ca 0.92 0.89 0.90 5000 ch 0.95 0.89 0.92 2688 gb 0.89 0.88 0.89 5000 ie 0.91 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.93 0.92 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.86 0.87 5000 ph 0.96 0.95 0.95 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.95 3788 us 0.91 0.93 0.92 5000 za 0.88 0.94 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23164) (66476, 23164) (322587, 23136) (66476, 23136) STARTING UNMASKING ROUND 250 eng cc unigrams 250 precision recall f1-score support au 0.94 0.95 0.94 5000 ca 0.92 0.89 0.90 5000 ch 0.95 0.89 0.92 2688 gb 0.89 0.88 0.89 5000 ie 0.91 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.92 0.92 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.96 0.95 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.95 0.95 3788 us 0.91 0.93 0.92 5000 za 0.88 0.94 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23136) (66476, 23136) (322587, 23108) (66476, 23108) STARTING UNMASKING ROUND 251 eng cc unigrams 251 precision recall f1-score support au 0.94 0.95 0.95 5000 ca 0.91 0.89 0.90 5000 ch 0.95 0.89 0.92 2688 gb 0.89 0.88 0.89 5000 ie 0.91 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.93 0.92 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.96 0.94 0.95 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.95 3788 us 0.91 0.93 0.92 5000 za 0.88 0.94 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23108) (66476, 23108) (322587, 23080) (66476, 23080) STARTING UNMASKING ROUND 252 eng cc unigrams 252 precision recall f1-score support au 0.94 0.95 0.95 5000 ca 0.91 0.89 0.90 5000 ch 0.95 0.89 0.92 2688 gb 0.89 0.88 0.88 5000 ie 0.90 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.93 0.93 0.93 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.96 0.95 0.95 5000 pk 0.99 0.99 0.99 5000 pt 0.96 0.95 0.95 3788 us 0.91 0.94 0.92 5000 za 0.88 0.94 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23080) (66476, 23080) (322587, 23052) (66476, 23052) STARTING UNMASKING ROUND 253 eng cc unigrams 253 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.91 0.89 0.90 5000 ch 0.95 0.89 0.92 2688 gb 0.89 0.88 0.88 5000 ie 0.91 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.92 0.92 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.95 0.95 3788 us 0.91 0.93 0.92 5000 za 0.88 0.94 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23052) (66476, 23052) (322587, 23024) (66476, 23024) STARTING UNMASKING ROUND 254 eng cc unigrams 254 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.91 0.89 0.90 5000 ch 0.95 0.88 0.92 2688 gb 0.89 0.88 0.88 5000 ie 0.90 0.92 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.92 0.92 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.94 0.95 3788 us 0.91 0.93 0.92 5000 za 0.88 0.94 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 23024) (66476, 23024) (322587, 22996) (66476, 22996) STARTING UNMASKING ROUND 255 eng cc unigrams 255 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.91 0.88 0.90 5000 ch 0.95 0.88 0.91 2688 gb 0.89 0.88 0.88 5000 ie 0.90 0.91 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.93 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.92 5000 za 0.88 0.93 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 22996) (66476, 22996) (322587, 22968) (66476, 22968) STARTING UNMASKING ROUND 256 eng cc unigrams 256 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.91 0.89 0.90 5000 ch 0.95 0.88 0.91 2688 gb 0.89 0.88 0.88 5000 ie 0.90 0.91 0.91 5000 in 0.95 0.94 0.95 5000 my 0.93 0.92 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.94 0.95 3788 us 0.91 0.93 0.92 5000 za 0.88 0.93 0.91 5000 avg / total 0.93 0.93 0.93 66476 Reducing feature vectors. (322587, 22968) (66476, 22968) (322587, 22940) (66476, 22940) STARTING UNMASKING ROUND 257 eng cc unigrams 257 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.91 0.88 0.90 5000 ch 0.95 0.88 0.91 2688 gb 0.89 0.87 0.88 5000 ie 0.90 0.91 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.92 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.92 5000 za 0.88 0.93 0.91 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22940) (66476, 22940) (322587, 22912) (66476, 22912) STARTING UNMASKING ROUND 258 eng cc unigrams 258 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.91 0.88 0.90 5000 ch 0.94 0.87 0.91 2688 gb 0.89 0.87 0.88 5000 ie 0.90 0.91 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.92 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.92 5000 za 0.88 0.93 0.91 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22912) (66476, 22912) (322587, 22884) (66476, 22884) STARTING UNMASKING ROUND 259 eng cc unigrams 259 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.91 0.88 0.89 5000 ch 0.94 0.88 0.91 2688 gb 0.89 0.87 0.88 5000 ie 0.90 0.91 0.91 5000 in 0.95 0.94 0.95 5000 my 0.92 0.92 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.92 5000 za 0.88 0.93 0.91 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22884) (66476, 22884) (322587, 22856) (66476, 22856) STARTING UNMASKING ROUND 260 eng cc unigrams 260 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.91 0.88 0.89 5000 ch 0.94 0.87 0.91 2688 gb 0.89 0.87 0.88 5000 ie 0.90 0.91 0.91 5000 in 0.95 0.94 0.94 5000 my 0.92 0.92 0.92 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.92 5000 za 0.88 0.93 0.91 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22856) (66476, 22856) (322587, 22828) (66476, 22828) STARTING UNMASKING ROUND 261 eng cc unigrams 261 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.94 0.87 0.91 2688 gb 0.89 0.87 0.88 5000 ie 0.90 0.91 0.91 5000 in 0.95 0.94 0.94 5000 my 0.92 0.92 0.92 5000 ng 0.96 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.92 5000 za 0.88 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22828) (66476, 22828) (322587, 22801) (66476, 22801) STARTING UNMASKING ROUND 262 eng cc unigrams 262 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.94 0.87 0.91 2688 gb 0.89 0.87 0.88 5000 ie 0.90 0.91 0.90 5000 in 0.95 0.94 0.94 5000 my 0.92 0.92 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.89 0.85 0.87 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.92 5000 za 0.88 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22801) (66476, 22801) (322587, 22773) (66476, 22773) STARTING UNMASKING ROUND 263 eng cc unigrams 263 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.94 0.87 0.90 2688 gb 0.88 0.87 0.88 5000 ie 0.90 0.91 0.90 5000 in 0.95 0.94 0.94 5000 my 0.92 0.92 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.89 0.84 0.86 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.99 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.91 5000 za 0.88 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22773) (66476, 22773) (322587, 22745) (66476, 22745) STARTING UNMASKING ROUND 264 eng cc unigrams 264 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.95 0.87 0.91 2688 gb 0.88 0.87 0.88 5000 ie 0.90 0.91 0.90 5000 in 0.95 0.94 0.94 5000 my 0.92 0.91 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.95 0.95 3788 us 0.90 0.93 0.92 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22745) (66476, 22745) (322587, 22717) (66476, 22717) STARTING UNMASKING ROUND 265 eng cc unigrams 265 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.94 0.87 0.90 2688 gb 0.88 0.87 0.87 5000 ie 0.90 0.91 0.90 5000 in 0.94 0.94 0.94 5000 my 0.92 0.91 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.95 0.95 0.95 3788 us 0.90 0.93 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22717) (66476, 22717) (322587, 22689) (66476, 22689) STARTING UNMASKING ROUND 266 eng cc unigrams 266 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.94 0.87 0.91 2688 gb 0.88 0.87 0.88 5000 ie 0.90 0.91 0.90 5000 in 0.94 0.94 0.94 5000 my 0.92 0.91 0.92 5000 ng 0.95 0.98 0.97 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.91 5000 za 0.88 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22689) (66476, 22689) (322587, 22661) (66476, 22661) STARTING UNMASKING ROUND 267 eng cc unigrams 267 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.95 0.87 0.90 2688 gb 0.88 0.87 0.88 5000 ie 0.90 0.91 0.90 5000 in 0.94 0.94 0.94 5000 my 0.92 0.91 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.95 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22661) (66476, 22661) (322587, 22633) (66476, 22633) STARTING UNMASKING ROUND 268 eng cc unigrams 268 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.94 0.87 0.90 2688 gb 0.88 0.86 0.87 5000 ie 0.90 0.91 0.90 5000 in 0.95 0.94 0.94 5000 my 0.92 0.91 0.91 5000 ng 0.95 0.98 0.97 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22633) (66476, 22633) (322587, 22605) (66476, 22605) STARTING UNMASKING ROUND 269 eng cc unigrams 269 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.94 0.87 0.90 2688 gb 0.88 0.87 0.87 5000 ie 0.90 0.91 0.91 5000 in 0.94 0.94 0.94 5000 my 0.92 0.91 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.94 0.95 3788 us 0.89 0.93 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22605) (66476, 22605) (322587, 22577) (66476, 22577) STARTING UNMASKING ROUND 270 eng cc unigrams 270 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.87 0.89 5000 ch 0.94 0.87 0.90 2688 gb 0.89 0.86 0.87 5000 ie 0.90 0.91 0.90 5000 in 0.94 0.94 0.94 5000 my 0.92 0.91 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.95 0.95 0.95 3788 us 0.90 0.93 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22577) (66476, 22577) (322587, 22549) (66476, 22549) STARTING UNMASKING ROUND 271 eng cc unigrams 271 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.88 0.89 5000 ch 0.94 0.87 0.90 2688 gb 0.88 0.86 0.87 5000 ie 0.90 0.91 0.90 5000 in 0.94 0.94 0.94 5000 my 0.92 0.91 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.95 0.95 3788 us 0.90 0.92 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22549) (66476, 22549) (322587, 22521) (66476, 22521) STARTING UNMASKING ROUND 272 eng cc unigrams 272 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.87 0.89 5000 ch 0.94 0.86 0.90 2688 gb 0.88 0.86 0.87 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.94 0.94 5000 my 0.92 0.91 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.93 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22521) (66476, 22521) (322587, 22493) (66476, 22493) STARTING UNMASKING ROUND 273 eng cc unigrams 273 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.87 0.89 5000 ch 0.94 0.87 0.90 2688 gb 0.88 0.86 0.87 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.94 0.94 5000 my 0.92 0.91 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.96 0.94 0.95 3788 us 0.90 0.92 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22493) (66476, 22493) (322587, 22465) (66476, 22465) STARTING UNMASKING ROUND 274 eng cc unigrams 274 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.87 0.89 5000 ch 0.94 0.86 0.90 2688 gb 0.88 0.86 0.87 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.94 5000 my 0.91 0.91 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.94 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.90 0.92 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22465) (66476, 22465) (322587, 22437) (66476, 22437) STARTING UNMASKING ROUND 275 eng cc unigrams 275 precision recall f1-score support au 0.93 0.95 0.94 5000 ca 0.90 0.87 0.88 5000 ch 0.94 0.86 0.90 2688 gb 0.88 0.86 0.87 5000 ie 0.90 0.91 0.90 5000 in 0.94 0.93 0.94 5000 my 0.91 0.91 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.84 0.86 5000 ph 0.95 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.95 0.94 0.94 3788 us 0.90 0.92 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22437) (66476, 22437) (322587, 22410) (66476, 22410) STARTING UNMASKING ROUND 276 eng cc unigrams 276 precision recall f1-score support au 0.92 0.95 0.93 5000 ca 0.90 0.87 0.88 5000 ch 0.94 0.86 0.90 2688 gb 0.88 0.86 0.87 5000 ie 0.90 0.91 0.90 5000 in 0.94 0.93 0.94 5000 my 0.91 0.91 0.91 5000 ng 0.94 0.98 0.96 5000 nz 0.88 0.83 0.85 5000 ph 0.94 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.89 0.92 0.91 5000 za 0.87 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22410) (66476, 22410) (322587, 22382) (66476, 22382) STARTING UNMASKING ROUND 277 eng cc unigrams 277 precision recall f1-score support au 0.92 0.95 0.93 5000 ca 0.90 0.87 0.88 5000 ch 0.94 0.86 0.90 2688 gb 0.88 0.86 0.87 5000 ie 0.90 0.91 0.90 5000 in 0.94 0.93 0.94 5000 my 0.91 0.91 0.91 5000 ng 0.94 0.98 0.96 5000 nz 0.87 0.83 0.85 5000 ph 0.94 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.95 0.94 0.94 3788 us 0.90 0.92 0.91 5000 za 0.86 0.93 0.90 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22382) (66476, 22382) (322587, 22354) (66476, 22354) STARTING UNMASKING ROUND 278 eng cc unigrams 278 precision recall f1-score support au 0.92 0.95 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.94 0.86 0.90 2688 gb 0.88 0.86 0.87 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.94 5000 my 0.91 0.90 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.83 0.85 5000 ph 0.94 0.94 0.94 5000 pk 0.98 0.99 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.90 0.92 0.91 5000 za 0.86 0.93 0.89 5000 avg / total 0.92 0.92 0.92 66476 Reducing feature vectors. (322587, 22354) (66476, 22354) (322587, 22326) (66476, 22326) STARTING UNMASKING ROUND 279 eng cc unigrams 279 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.94 0.86 0.90 2688 gb 0.88 0.85 0.87 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.94 5000 my 0.91 0.90 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.88 0.83 0.85 5000 ph 0.94 0.93 0.94 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.89 0.92 0.91 5000 za 0.86 0.93 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22326) (66476, 22326) (322587, 22298) (66476, 22298) STARTING UNMASKING ROUND 280 eng cc unigrams 280 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.93 0.86 0.89 2688 gb 0.88 0.85 0.87 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.94 5000 my 0.91 0.90 0.91 5000 ng 0.95 0.98 0.96 5000 nz 0.87 0.83 0.85 5000 ph 0.94 0.93 0.94 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.94 3788 us 0.90 0.92 0.91 5000 za 0.86 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22298) (66476, 22298) (322587, 22270) (66476, 22270) STARTING UNMASKING ROUND 281 eng cc unigrams 281 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.93 0.86 0.89 2688 gb 0.88 0.85 0.87 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.94 5000 my 0.91 0.90 0.90 5000 ng 0.94 0.98 0.96 5000 nz 0.87 0.83 0.85 5000 ph 0.94 0.93 0.94 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.90 0.92 0.91 5000 za 0.86 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22270) (66476, 22270) (322587, 22243) (66476, 22243) STARTING UNMASKING ROUND 282 eng cc unigrams 282 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.93 0.86 0.89 2688 gb 0.88 0.85 0.87 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.93 5000 my 0.91 0.90 0.90 5000 ng 0.94 0.98 0.96 5000 nz 0.87 0.83 0.85 5000 ph 0.94 0.93 0.94 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.94 3788 us 0.89 0.92 0.91 5000 za 0.86 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22243) (66476, 22243) (322587, 22216) (66476, 22216) STARTING UNMASKING ROUND 283 eng cc unigrams 283 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.93 0.85 0.89 2688 gb 0.88 0.85 0.87 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.93 5000 my 0.91 0.90 0.90 5000 ng 0.94 0.98 0.96 5000 nz 0.87 0.83 0.85 5000 ph 0.94 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.94 3788 us 0.89 0.92 0.90 5000 za 0.86 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22216) (66476, 22216) (322587, 22188) (66476, 22188) STARTING UNMASKING ROUND 284 eng cc unigrams 284 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.93 0.85 0.89 2688 gb 0.87 0.85 0.86 5000 ie 0.89 0.90 0.90 5000 in 0.94 0.93 0.94 5000 my 0.91 0.90 0.90 5000 ng 0.94 0.98 0.96 5000 nz 0.87 0.83 0.85 5000 ph 0.94 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.94 3788 us 0.89 0.92 0.90 5000 za 0.86 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22188) (66476, 22188) (322587, 22160) (66476, 22160) STARTING UNMASKING ROUND 285 eng cc unigrams 285 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.93 0.85 0.89 2688 gb 0.88 0.85 0.86 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.93 5000 my 0.90 0.90 0.90 5000 ng 0.94 0.98 0.96 5000 nz 0.87 0.82 0.85 5000 ph 0.94 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.89 0.92 0.90 5000 za 0.86 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22160) (66476, 22160) (322587, 22132) (66476, 22132) STARTING UNMASKING ROUND 286 eng cc unigrams 286 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.94 0.85 0.89 2688 gb 0.88 0.85 0.86 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.93 5000 my 0.90 0.90 0.90 5000 ng 0.94 0.98 0.96 5000 nz 0.87 0.82 0.85 5000 ph 0.94 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.89 0.92 0.90 5000 za 0.85 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22132) (66476, 22132) (322587, 22104) (66476, 22104) STARTING UNMASKING ROUND 287 eng cc unigrams 287 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.87 0.88 5000 ch 0.93 0.85 0.89 2688 gb 0.88 0.85 0.86 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.98 0.96 5000 nz 0.87 0.82 0.84 5000 ph 0.94 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.88 0.92 0.90 5000 za 0.85 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22104) (66476, 22104) (322587, 22076) (66476, 22076) STARTING UNMASKING ROUND 288 eng cc unigrams 288 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.86 0.88 5000 ch 0.93 0.85 0.89 2688 gb 0.88 0.85 0.86 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.93 5000 my 0.90 0.90 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.87 0.82 0.84 5000 ph 0.94 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.88 0.92 0.90 5000 za 0.85 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22076) (66476, 22076) (322587, 22048) (66476, 22048) STARTING UNMASKING ROUND 289 eng cc unigrams 289 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.86 0.88 5000 ch 0.94 0.85 0.89 2688 gb 0.88 0.85 0.86 5000 ie 0.89 0.91 0.90 5000 in 0.94 0.93 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.82 0.84 5000 ph 0.94 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.95 3788 us 0.88 0.91 0.90 5000 za 0.85 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22048) (66476, 22048) (322587, 22020) (66476, 22020) STARTING UNMASKING ROUND 290 eng cc unigrams 290 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.86 0.88 5000 ch 0.94 0.85 0.89 2688 gb 0.88 0.85 0.86 5000 ie 0.88 0.91 0.90 5000 in 0.94 0.93 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.87 0.82 0.84 5000 ph 0.93 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.94 3788 us 0.89 0.91 0.90 5000 za 0.85 0.92 0.89 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 22020) (66476, 22020) (322587, 21992) (66476, 21992) STARTING UNMASKING ROUND 291 eng cc unigrams 291 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.89 0.86 0.88 5000 ch 0.94 0.85 0.89 2688 gb 0.88 0.85 0.86 5000 ie 0.88 0.91 0.90 5000 in 0.94 0.93 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.87 0.82 0.84 5000 ph 0.93 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.89 0.91 0.90 5000 za 0.85 0.92 0.88 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 21992) (66476, 21992) (322587, 21964) (66476, 21964) STARTING UNMASKING ROUND 292 eng cc unigrams 292 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.94 0.85 0.89 2688 gb 0.88 0.85 0.86 5000 ie 0.88 0.91 0.90 5000 in 0.94 0.92 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.82 0.84 5000 ph 0.93 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.89 0.91 0.90 5000 za 0.85 0.92 0.88 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 21964) (66476, 21964) (322587, 21936) (66476, 21936) STARTING UNMASKING ROUND 293 eng cc unigrams 293 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.94 0.85 0.89 2688 gb 0.88 0.84 0.86 5000 ie 0.88 0.91 0.90 5000 in 0.94 0.92 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.82 0.84 5000 ph 0.94 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.94 3788 us 0.89 0.91 0.90 5000 za 0.85 0.92 0.88 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 21936) (66476, 21936) (322587, 21908) (66476, 21908) STARTING UNMASKING ROUND 294 eng cc unigrams 294 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.93 0.84 0.89 2688 gb 0.88 0.84 0.86 5000 ie 0.88 0.90 0.89 5000 in 0.94 0.92 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.82 0.84 5000 ph 0.94 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.94 0.94 3788 us 0.89 0.91 0.90 5000 za 0.85 0.92 0.88 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 21908) (66476, 21908) (322587, 21880) (66476, 21880) STARTING UNMASKING ROUND 295 eng cc unigrams 295 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.93 0.84 0.89 2688 gb 0.88 0.84 0.86 5000 ie 0.88 0.90 0.89 5000 in 0.94 0.92 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.82 0.84 5000 ph 0.93 0.93 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.89 0.91 0.90 5000 za 0.85 0.92 0.88 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 21880) (66476, 21880) (322587, 21853) (66476, 21853) STARTING UNMASKING ROUND 296 eng cc unigrams 296 precision recall f1-score support au 0.92 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.94 0.84 0.89 2688 gb 0.87 0.84 0.86 5000 ie 0.88 0.90 0.89 5000 in 0.93 0.92 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.81 0.84 5000 ph 0.93 0.92 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.91 0.90 5000 za 0.85 0.92 0.88 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 21853) (66476, 21853) (322587, 21825) (66476, 21825) STARTING UNMASKING ROUND 297 eng cc unigrams 297 precision recall f1-score support au 0.91 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.93 0.84 0.89 2688 gb 0.88 0.84 0.86 5000 ie 0.88 0.90 0.89 5000 in 0.93 0.92 0.93 5000 my 0.90 0.89 0.90 5000 ng 0.94 0.97 0.96 5000 nz 0.87 0.81 0.84 5000 ph 0.93 0.92 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.91 0.90 5000 za 0.85 0.92 0.88 5000 avg / total 0.91 0.91 0.91 66476 Reducing feature vectors. (322587, 21825) (66476, 21825) (322587, 21797) (66476, 21797) STARTING UNMASKING ROUND 298 eng cc unigrams 298 precision recall f1-score support au 0.91 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.93 0.84 0.88 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.93 5000 my 0.90 0.89 0.89 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.81 0.84 5000 ph 0.93 0.92 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.91 0.89 5000 za 0.85 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21797) (66476, 21797) (322587, 21769) (66476, 21769) STARTING UNMASKING ROUND 299 eng cc unigrams 299 precision recall f1-score support au 0.91 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.93 0.84 0.88 2688 gb 0.87 0.84 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.93 5000 my 0.90 0.89 0.89 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.81 0.84 5000 ph 0.93 0.92 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.91 0.89 5000 za 0.85 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21769) (66476, 21769) (322587, 21742) (66476, 21742) STARTING UNMASKING ROUND 300 eng cc unigrams 300 precision recall f1-score support au 0.91 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.93 0.84 0.88 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.93 5000 my 0.90 0.89 0.89 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.81 0.84 5000 ph 0.93 0.92 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.90 0.89 5000 za 0.84 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21742) (66476, 21742) (322587, 21714) (66476, 21714) STARTING UNMASKING ROUND 301 eng cc unigrams 301 precision recall f1-score support au 0.91 0.94 0.93 5000 ca 0.88 0.86 0.87 5000 ch 0.93 0.83 0.88 2688 gb 0.87 0.84 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.92 5000 my 0.90 0.89 0.89 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.81 0.84 5000 ph 0.93 0.92 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.90 0.89 5000 za 0.84 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21714) (66476, 21714) (322587, 21686) (66476, 21686) STARTING UNMASKING ROUND 302 eng cc unigrams 302 precision recall f1-score support au 0.91 0.94 0.93 5000 ca 0.88 0.85 0.86 5000 ch 0.93 0.83 0.88 2688 gb 0.87 0.84 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.92 5000 my 0.90 0.89 0.89 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.81 0.83 5000 ph 0.93 0.92 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.90 0.89 5000 za 0.84 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21686) (66476, 21686) (322587, 21658) (66476, 21658) STARTING UNMASKING ROUND 303 eng cc unigrams 303 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.87 0.85 0.86 5000 ch 0.93 0.83 0.88 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.92 5000 my 0.89 0.89 0.89 5000 ng 0.94 0.97 0.96 5000 nz 0.86 0.81 0.83 5000 ph 0.93 0.92 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.90 0.89 5000 za 0.85 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21658) (66476, 21658) (322587, 21632) (66476, 21632) STARTING UNMASKING ROUND 304 eng cc unigrams 304 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.88 0.85 0.86 5000 ch 0.93 0.83 0.88 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.92 5000 my 0.89 0.89 0.89 5000 ng 0.94 0.97 0.95 5000 nz 0.86 0.81 0.83 5000 ph 0.93 0.92 0.93 5000 pk 0.97 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.90 0.89 5000 za 0.85 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21632) (66476, 21632) (322587, 21604) (66476, 21604) STARTING UNMASKING ROUND 305 eng cc unigrams 305 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.87 0.86 0.86 5000 ch 0.93 0.83 0.88 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.92 5000 my 0.89 0.89 0.89 5000 ng 0.94 0.97 0.95 5000 nz 0.86 0.81 0.84 5000 ph 0.93 0.92 0.93 5000 pk 0.97 0.98 0.98 5000 pt 0.95 0.93 0.94 3788 us 0.88 0.90 0.89 5000 za 0.85 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21604) (66476, 21604) (322587, 21576) (66476, 21576) STARTING UNMASKING ROUND 306 eng cc unigrams 306 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.87 0.86 0.86 5000 ch 0.93 0.83 0.88 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.92 5000 my 0.89 0.89 0.89 5000 ng 0.94 0.97 0.95 5000 nz 0.86 0.81 0.83 5000 ph 0.93 0.92 0.93 5000 pk 0.98 0.98 0.98 5000 pt 0.94 0.93 0.94 3788 us 0.88 0.90 0.89 5000 za 0.84 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21576) (66476, 21576) (322587, 21548) (66476, 21548) STARTING UNMASKING ROUND 307 eng cc unigrams 307 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.87 0.85 0.86 5000 ch 0.94 0.83 0.88 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.89 5000 in 0.93 0.92 0.92 5000 my 0.89 0.89 0.89 5000 ng 0.94 0.97 0.95 5000 nz 0.86 0.81 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.93 3788 us 0.88 0.90 0.89 5000 za 0.84 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21548) (66476, 21548) (322587, 21520) (66476, 21520) STARTING UNMASKING ROUND 308 eng cc unigrams 308 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.87 0.85 0.86 5000 ch 0.93 0.83 0.88 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.88 0.89 5000 ng 0.93 0.97 0.95 5000 nz 0.86 0.81 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.93 3788 us 0.88 0.90 0.89 5000 za 0.84 0.92 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21520) (66476, 21520) (322587, 21492) (66476, 21492) STARTING UNMASKING ROUND 309 eng cc unigrams 309 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.87 0.85 0.86 5000 ch 0.93 0.83 0.87 2688 gb 0.86 0.83 0.85 5000 ie 0.87 0.90 0.88 5000 in 0.93 0.92 0.92 5000 my 0.89 0.88 0.89 5000 ng 0.94 0.97 0.95 5000 nz 0.86 0.80 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.94 3788 us 0.88 0.90 0.89 5000 za 0.84 0.91 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21492) (66476, 21492) (322587, 21465) (66476, 21465) STARTING UNMASKING ROUND 310 eng cc unigrams 310 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.87 0.85 0.86 5000 ch 0.93 0.82 0.87 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.88 0.89 5000 ng 0.93 0.97 0.95 5000 nz 0.86 0.80 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.93 3788 us 0.87 0.90 0.88 5000 za 0.84 0.91 0.88 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21465) (66476, 21465) (322587, 21437) (66476, 21437) STARTING UNMASKING ROUND 311 eng cc unigrams 311 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.87 0.85 0.86 5000 ch 0.93 0.82 0.87 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.88 0.89 5000 ng 0.93 0.97 0.95 5000 nz 0.86 0.80 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.93 3788 us 0.87 0.89 0.88 5000 za 0.84 0.91 0.87 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21437) (66476, 21437) (322587, 21409) (66476, 21409) STARTING UNMASKING ROUND 312 eng cc unigrams 312 precision recall f1-score support au 0.90 0.94 0.92 5000 ca 0.87 0.85 0.86 5000 ch 0.92 0.82 0.87 2688 gb 0.87 0.83 0.85 5000 ie 0.87 0.90 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.88 0.89 5000 ng 0.93 0.97 0.95 5000 nz 0.86 0.80 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.94 3788 us 0.87 0.89 0.88 5000 za 0.84 0.91 0.87 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21409) (66476, 21409) (322587, 21381) (66476, 21381) STARTING UNMASKING ROUND 313 eng cc unigrams 313 precision recall f1-score support au 0.90 0.94 0.92 5000 ca 0.87 0.85 0.86 5000 ch 0.92 0.82 0.87 2688 gb 0.86 0.83 0.85 5000 ie 0.87 0.89 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.88 0.89 5000 ng 0.93 0.97 0.95 5000 nz 0.86 0.80 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.93 3788 us 0.87 0.89 0.88 5000 za 0.84 0.91 0.87 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21381) (66476, 21381) (322587, 21354) (66476, 21354) STARTING UNMASKING ROUND 314 eng cc unigrams 314 precision recall f1-score support au 0.90 0.94 0.92 5000 ca 0.87 0.85 0.86 5000 ch 0.93 0.82 0.87 2688 gb 0.86 0.83 0.85 5000 ie 0.87 0.89 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.88 0.89 5000 ng 0.93 0.97 0.95 5000 nz 0.86 0.80 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.93 3788 us 0.87 0.89 0.88 5000 za 0.84 0.91 0.87 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21354) (66476, 21354) (322587, 21326) (66476, 21326) STARTING UNMASKING ROUND 315 eng cc unigrams 315 precision recall f1-score support au 0.91 0.94 0.92 5000 ca 0.86 0.85 0.85 5000 ch 0.92 0.82 0.87 2688 gb 0.86 0.83 0.85 5000 ie 0.87 0.89 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.89 0.89 5000 ng 0.93 0.97 0.95 5000 nz 0.85 0.80 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.93 3788 us 0.87 0.89 0.88 5000 za 0.83 0.91 0.87 5000 avg / total 0.90 0.90 0.90 66476 Reducing feature vectors. (322587, 21326) (66476, 21326) (322587, 21299) (66476, 21299) STARTING UNMASKING ROUND 316 eng cc unigrams 316 precision recall f1-score support au 0.90 0.94 0.92 5000 ca 0.86 0.84 0.85 5000 ch 0.92 0.81 0.87 2688 gb 0.86 0.83 0.84 5000 ie 0.86 0.89 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.88 0.89 5000 ng 0.93 0.97 0.95 5000 nz 0.85 0.80 0.83 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.92 0.93 3788 us 0.87 0.89 0.88 5000 za 0.83 0.91 0.87 5000 avg / total 0.90 0.90 0.89 66476 Reducing feature vectors. (322587, 21299) (66476, 21299) (322587, 21272) (66476, 21272) STARTING UNMASKING ROUND 317 eng cc unigrams 317 precision recall f1-score support au 0.90 0.94 0.92 5000 ca 0.86 0.84 0.85 5000 ch 0.92 0.82 0.87 2688 gb 0.86 0.83 0.85 5000 ie 0.86 0.89 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.88 0.88 5000 ng 0.93 0.97 0.95 5000 nz 0.85 0.79 0.82 5000 ph 0.93 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.93 3788 us 0.87 0.89 0.88 5000 za 0.83 0.91 0.87 5000 avg / total 0.90 0.89 0.89 66476 Reducing feature vectors. (322587, 21272) (66476, 21272) (322587, 21245) (66476, 21245) STARTING UNMASKING ROUND 318 eng cc unigrams 318 precision recall f1-score support au 0.90 0.94 0.92 5000 ca 0.86 0.84 0.85 5000 ch 0.93 0.81 0.87 2688 gb 0.86 0.83 0.85 5000 ie 0.86 0.90 0.88 5000 in 0.93 0.91 0.92 5000 my 0.89 0.88 0.88 5000 ng 0.93 0.97 0.95 5000 nz 0.85 0.79 0.82 5000 ph 0.92 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.92 0.93 3788 us 0.87 0.89 0.88 5000 za 0.83 0.91 0.87 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 21245) (66476, 21245) (322587, 21217) (66476, 21217) STARTING UNMASKING ROUND 319 eng cc unigrams 319 precision recall f1-score support au 0.90 0.93 0.92 5000 ca 0.86 0.84 0.85 5000 ch 0.93 0.82 0.87 2688 gb 0.86 0.82 0.84 5000 ie 0.86 0.89 0.87 5000 in 0.92 0.91 0.92 5000 my 0.88 0.88 0.88 5000 ng 0.93 0.97 0.95 5000 nz 0.85 0.79 0.82 5000 ph 0.92 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.93 0.93 3788 us 0.87 0.88 0.88 5000 za 0.83 0.91 0.87 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 21217) (66476, 21217) (322587, 21189) (66476, 21189) STARTING UNMASKING ROUND 320 eng cc unigrams 320 precision recall f1-score support au 0.90 0.93 0.92 5000 ca 0.86 0.84 0.85 5000 ch 0.93 0.81 0.87 2688 gb 0.85 0.82 0.84 5000 ie 0.86 0.89 0.88 5000 in 0.92 0.91 0.92 5000 my 0.88 0.88 0.88 5000 ng 0.93 0.97 0.95 5000 nz 0.85 0.79 0.82 5000 ph 0.92 0.92 0.92 5000 pk 0.97 0.98 0.98 5000 pt 0.94 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.91 0.87 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 21189) (66476, 21189) (322587, 21161) (66476, 21161) STARTING UNMASKING ROUND 321 eng cc unigrams 321 precision recall f1-score support au 0.90 0.93 0.92 5000 ca 0.86 0.84 0.85 5000 ch 0.93 0.81 0.87 2688 gb 0.85 0.81 0.83 5000 ie 0.86 0.89 0.87 5000 in 0.92 0.91 0.92 5000 my 0.88 0.88 0.88 5000 ng 0.93 0.97 0.95 5000 nz 0.85 0.79 0.82 5000 ph 0.92 0.91 0.92 5000 pk 0.97 0.98 0.97 5000 pt 0.94 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.91 0.86 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 21161) (66476, 21161) (322587, 21133) (66476, 21133) STARTING UNMASKING ROUND 322 eng cc unigrams 322 precision recall f1-score support au 0.90 0.93 0.92 5000 ca 0.86 0.84 0.85 5000 ch 0.93 0.81 0.87 2688 gb 0.85 0.81 0.83 5000 ie 0.86 0.89 0.87 5000 in 0.92 0.91 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.97 0.94 5000 nz 0.85 0.79 0.82 5000 ph 0.92 0.91 0.92 5000 pk 0.97 0.98 0.97 5000 pt 0.94 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.91 0.87 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 21133) (66476, 21133) (322587, 21105) (66476, 21105) STARTING UNMASKING ROUND 323 eng cc unigrams 323 precision recall f1-score support au 0.90 0.93 0.92 5000 ca 0.86 0.84 0.85 5000 ch 0.92 0.82 0.87 2688 gb 0.85 0.81 0.83 5000 ie 0.86 0.89 0.87 5000 in 0.92 0.91 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.97 0.94 5000 nz 0.86 0.79 0.82 5000 ph 0.92 0.91 0.92 5000 pk 0.97 0.98 0.97 5000 pt 0.94 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.91 0.87 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 21105) (66476, 21105) (322587, 21078) (66476, 21078) STARTING UNMASKING ROUND 324 eng cc unigrams 324 precision recall f1-score support au 0.90 0.93 0.91 5000 ca 0.86 0.84 0.85 5000 ch 0.93 0.81 0.86 2688 gb 0.85 0.81 0.83 5000 ie 0.86 0.89 0.87 5000 in 0.92 0.91 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.97 0.94 5000 nz 0.85 0.79 0.82 5000 ph 0.92 0.91 0.92 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.90 0.86 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 21078) (66476, 21078) (322587, 21050) (66476, 21050) STARTING UNMASKING ROUND 325 eng cc unigrams 325 precision recall f1-score support au 0.90 0.93 0.91 5000 ca 0.86 0.84 0.85 5000 ch 0.92 0.81 0.86 2688 gb 0.85 0.81 0.83 5000 ie 0.86 0.89 0.87 5000 in 0.92 0.91 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.96 0.94 5000 nz 0.85 0.78 0.82 5000 ph 0.92 0.91 0.92 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.91 0.87 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 21050) (66476, 21050) (322587, 21022) (66476, 21022) STARTING UNMASKING ROUND 326 eng cc unigrams 326 precision recall f1-score support au 0.90 0.93 0.91 5000 ca 0.86 0.84 0.85 5000 ch 0.93 0.81 0.86 2688 gb 0.85 0.81 0.83 5000 ie 0.86 0.89 0.87 5000 in 0.92 0.91 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.96 0.94 5000 nz 0.85 0.78 0.82 5000 ph 0.92 0.91 0.92 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.91 0.87 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 21022) (66476, 21022) (322587, 20994) (66476, 20994) STARTING UNMASKING ROUND 327 eng cc unigrams 327 precision recall f1-score support au 0.90 0.93 0.91 5000 ca 0.86 0.84 0.85 5000 ch 0.92 0.81 0.86 2688 gb 0.85 0.81 0.83 5000 ie 0.85 0.89 0.87 5000 in 0.92 0.90 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.96 0.94 5000 nz 0.85 0.78 0.82 5000 ph 0.92 0.91 0.92 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.91 0.87 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 20994) (66476, 20994) (322587, 20966) (66476, 20966) STARTING UNMASKING ROUND 328 eng cc unigrams 328 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.86 0.84 0.85 5000 ch 0.92 0.81 0.86 2688 gb 0.85 0.81 0.83 5000 ie 0.85 0.89 0.87 5000 in 0.92 0.90 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.96 0.94 5000 nz 0.85 0.78 0.81 5000 ph 0.92 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.94 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.91 0.86 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 20966) (66476, 20966) (322587, 20938) (66476, 20938) STARTING UNMASKING ROUND 329 eng cc unigrams 329 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.86 0.84 0.85 5000 ch 0.93 0.80 0.86 2688 gb 0.85 0.81 0.83 5000 ie 0.85 0.89 0.87 5000 in 0.92 0.91 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.96 0.94 5000 nz 0.85 0.78 0.81 5000 ph 0.92 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.94 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.90 0.86 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 20938) (66476, 20938) (322587, 20911) (66476, 20911) STARTING UNMASKING ROUND 330 eng cc unigrams 330 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.93 0.80 0.86 2688 gb 0.85 0.81 0.83 5000 ie 0.85 0.89 0.87 5000 in 0.92 0.91 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.96 0.94 5000 nz 0.85 0.78 0.81 5000 ph 0.92 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.94 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.83 0.90 0.86 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 20911) (66476, 20911) (322587, 20883) (66476, 20883) STARTING UNMASKING ROUND 331 eng cc unigrams 331 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.84 0.84 5000 ch 0.92 0.80 0.86 2688 gb 0.85 0.81 0.83 5000 ie 0.85 0.89 0.87 5000 in 0.92 0.90 0.91 5000 my 0.88 0.88 0.88 5000 ng 0.92 0.96 0.94 5000 nz 0.85 0.78 0.81 5000 ph 0.92 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.82 0.90 0.86 5000 avg / total 0.89 0.89 0.89 66476 Reducing feature vectors. (322587, 20883) (66476, 20883) (322587, 20855) (66476, 20855) STARTING UNMASKING ROUND 332 eng cc unigrams 332 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.92 0.80 0.86 2688 gb 0.85 0.80 0.83 5000 ie 0.85 0.89 0.87 5000 in 0.91 0.90 0.91 5000 my 0.87 0.88 0.88 5000 ng 0.92 0.96 0.94 5000 nz 0.85 0.78 0.81 5000 ph 0.92 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.92 0.93 3788 us 0.86 0.88 0.87 5000 za 0.82 0.90 0.86 5000 avg / total 0.89 0.89 0.88 66476 Reducing feature vectors. (322587, 20855) (66476, 20855) (322587, 20827) (66476, 20827) STARTING UNMASKING ROUND 333 eng cc unigrams 333 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.92 0.80 0.86 2688 gb 0.85 0.80 0.82 5000 ie 0.85 0.89 0.87 5000 in 0.91 0.90 0.91 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.94 5000 nz 0.85 0.77 0.81 5000 ph 0.92 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.92 0.93 3788 us 0.85 0.88 0.86 5000 za 0.82 0.90 0.86 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20827) (66476, 20827) (322587, 20799) (66476, 20799) STARTING UNMASKING ROUND 334 eng cc unigrams 334 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.92 0.80 0.86 2688 gb 0.85 0.80 0.82 5000 ie 0.85 0.89 0.87 5000 in 0.91 0.90 0.91 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.94 5000 nz 0.85 0.77 0.81 5000 ph 0.92 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.85 0.87 0.86 5000 za 0.82 0.90 0.86 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20799) (66476, 20799) (322587, 20771) (66476, 20771) STARTING UNMASKING ROUND 335 eng cc unigrams 335 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.92 0.80 0.85 2688 gb 0.85 0.80 0.82 5000 ie 0.85 0.89 0.87 5000 in 0.91 0.90 0.91 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.94 5000 nz 0.85 0.77 0.81 5000 ph 0.91 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.85 0.87 0.86 5000 za 0.82 0.90 0.86 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20771) (66476, 20771) (322587, 20744) (66476, 20744) STARTING UNMASKING ROUND 336 eng cc unigrams 336 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.92 0.80 0.86 2688 gb 0.85 0.79 0.82 5000 ie 0.85 0.89 0.87 5000 in 0.91 0.90 0.91 5000 my 0.87 0.87 0.87 5000 ng 0.92 0.96 0.94 5000 nz 0.85 0.77 0.81 5000 ph 0.92 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.85 0.88 0.86 5000 za 0.82 0.90 0.86 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20744) (66476, 20744) (322587, 20716) (66476, 20716) STARTING UNMASKING ROUND 337 eng cc unigrams 337 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.92 0.80 0.85 2688 gb 0.85 0.79 0.82 5000 ie 0.85 0.89 0.87 5000 in 0.91 0.90 0.91 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.94 5000 nz 0.85 0.77 0.81 5000 ph 0.91 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.85 0.87 0.86 5000 za 0.82 0.90 0.86 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20716) (66476, 20716) (322587, 20688) (66476, 20688) STARTING UNMASKING ROUND 338 eng cc unigrams 338 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.91 0.80 0.85 2688 gb 0.84 0.79 0.82 5000 ie 0.85 0.89 0.87 5000 in 0.91 0.90 0.91 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.94 5000 nz 0.85 0.77 0.81 5000 ph 0.91 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.85 0.87 0.86 5000 za 0.82 0.90 0.86 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20688) (66476, 20688) (322587, 20660) (66476, 20660) STARTING UNMASKING ROUND 339 eng cc unigrams 339 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.91 0.79 0.85 2688 gb 0.85 0.79 0.82 5000 ie 0.84 0.89 0.87 5000 in 0.91 0.90 0.91 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.94 5000 nz 0.85 0.77 0.81 5000 ph 0.91 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.85 0.87 0.86 5000 za 0.82 0.90 0.86 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20660) (66476, 20660) (322587, 20632) (66476, 20632) STARTING UNMASKING ROUND 340 eng cc unigrams 340 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.91 0.79 0.85 2688 gb 0.84 0.79 0.82 5000 ie 0.84 0.89 0.86 5000 in 0.91 0.90 0.91 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.94 5000 nz 0.85 0.77 0.81 5000 ph 0.91 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.85 0.87 0.86 5000 za 0.82 0.90 0.85 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20632) (66476, 20632) (322587, 20604) (66476, 20604) STARTING UNMASKING ROUND 341 eng cc unigrams 341 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.92 0.79 0.85 2688 gb 0.84 0.79 0.82 5000 ie 0.84 0.89 0.86 5000 in 0.91 0.90 0.90 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.93 5000 nz 0.85 0.77 0.81 5000 ph 0.91 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.85 0.87 0.86 5000 za 0.82 0.90 0.86 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20604) (66476, 20604) (322587, 20576) (66476, 20576) STARTING UNMASKING ROUND 342 eng cc unigrams 342 precision recall f1-score support au 0.89 0.93 0.91 5000 ca 0.85 0.83 0.84 5000 ch 0.91 0.79 0.85 2688 gb 0.84 0.79 0.81 5000 ie 0.84 0.89 0.86 5000 in 0.91 0.90 0.90 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.93 5000 nz 0.85 0.77 0.81 5000 ph 0.91 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.85 0.87 0.86 5000 za 0.82 0.90 0.86 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20576) (66476, 20576) (322587, 20548) (66476, 20548) STARTING UNMASKING ROUND 343 eng cc unigrams 343 precision recall f1-score support au 0.88 0.93 0.90 5000 ca 0.85 0.83 0.84 5000 ch 0.91 0.78 0.84 2688 gb 0.84 0.79 0.81 5000 ie 0.84 0.89 0.86 5000 in 0.91 0.90 0.90 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.94 5000 nz 0.85 0.76 0.80 5000 ph 0.91 0.91 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.87 0.86 5000 za 0.81 0.90 0.85 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20548) (66476, 20548) (322587, 20520) (66476, 20520) STARTING UNMASKING ROUND 344 eng cc unigrams 344 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.82 0.83 5000 ch 0.91 0.78 0.84 2688 gb 0.84 0.78 0.81 5000 ie 0.84 0.88 0.86 5000 in 0.91 0.90 0.90 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.91 0.90 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.87 0.86 5000 za 0.81 0.90 0.85 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20520) (66476, 20520) (322587, 20492) (66476, 20492) STARTING UNMASKING ROUND 345 eng cc unigrams 345 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.82 0.83 5000 ch 0.91 0.78 0.84 2688 gb 0.84 0.78 0.81 5000 ie 0.84 0.89 0.86 5000 in 0.91 0.90 0.90 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.91 0.90 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.87 0.85 5000 za 0.81 0.90 0.85 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20492) (66476, 20492) (322587, 20464) (66476, 20464) STARTING UNMASKING ROUND 346 eng cc unigrams 346 precision recall f1-score support au 0.88 0.93 0.90 5000 ca 0.84 0.82 0.83 5000 ch 0.91 0.78 0.84 2688 gb 0.84 0.78 0.81 5000 ie 0.83 0.88 0.86 5000 in 0.91 0.90 0.90 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.91 0.90 0.91 5000 pk 0.97 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.87 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.88 0.88 0.88 66476 Reducing feature vectors. (322587, 20464) (66476, 20464) (322587, 20436) (66476, 20436) STARTING UNMASKING ROUND 347 eng cc unigrams 347 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.82 0.83 5000 ch 0.91 0.78 0.84 2688 gb 0.84 0.78 0.81 5000 ie 0.83 0.88 0.86 5000 in 0.91 0.90 0.90 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.91 0.90 0.90 5000 pk 0.96 0.98 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.87 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.88 0.88 0.87 66476 Reducing feature vectors. (322587, 20436) (66476, 20436) (322587, 20408) (66476, 20408) STARTING UNMASKING ROUND 348 eng cc unigrams 348 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.82 0.83 5000 ch 0.91 0.78 0.84 2688 gb 0.84 0.78 0.81 5000 ie 0.83 0.88 0.86 5000 in 0.91 0.89 0.90 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.91 0.90 0.90 5000 pk 0.97 0.97 0.97 5000 pt 0.94 0.91 0.92 3788 us 0.84 0.87 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.88 0.87 0.87 66476 Reducing feature vectors. (322587, 20408) (66476, 20408) (322587, 20380) (66476, 20380) STARTING UNMASKING ROUND 349 eng cc unigrams 349 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.82 0.83 5000 ch 0.91 0.78 0.84 2688 gb 0.84 0.78 0.81 5000 ie 0.83 0.88 0.86 5000 in 0.91 0.90 0.90 5000 my 0.87 0.86 0.87 5000 ng 0.91 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.86 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20380) (66476, 20380) (322587, 20352) (66476, 20352) STARTING UNMASKING ROUND 350 eng cc unigrams 350 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.82 0.83 5000 ch 0.91 0.77 0.84 2688 gb 0.84 0.78 0.81 5000 ie 0.83 0.88 0.85 5000 in 0.91 0.89 0.90 5000 my 0.87 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.86 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20352) (66476, 20352) (322587, 20324) (66476, 20324) STARTING UNMASKING ROUND 351 eng cc unigrams 351 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.82 0.83 5000 ch 0.91 0.77 0.83 2688 gb 0.84 0.78 0.81 5000 ie 0.83 0.88 0.85 5000 in 0.91 0.89 0.90 5000 my 0.87 0.87 0.87 5000 ng 0.91 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.86 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20324) (66476, 20324) (322587, 20296) (66476, 20296) STARTING UNMASKING ROUND 352 eng cc unigrams 352 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.82 0.83 5000 ch 0.90 0.77 0.83 2688 gb 0.84 0.77 0.80 5000 ie 0.82 0.88 0.85 5000 in 0.90 0.89 0.90 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.86 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20296) (66476, 20296) (322587, 20268) (66476, 20268) STARTING UNMASKING ROUND 353 eng cc unigrams 353 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.81 0.83 5000 ch 0.91 0.77 0.83 2688 gb 0.84 0.77 0.80 5000 ie 0.83 0.88 0.85 5000 in 0.90 0.89 0.90 5000 my 0.87 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.76 0.80 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.84 0.86 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20268) (66476, 20268) (322587, 20240) (66476, 20240) STARTING UNMASKING ROUND 354 eng cc unigrams 354 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.83 0.81 0.82 5000 ch 0.91 0.76 0.83 2688 gb 0.83 0.77 0.80 5000 ie 0.82 0.88 0.85 5000 in 0.90 0.89 0.90 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.76 0.80 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.90 0.92 3788 us 0.83 0.86 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20240) (66476, 20240) (322587, 20212) (66476, 20212) STARTING UNMASKING ROUND 355 eng cc unigrams 355 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.84 0.81 0.82 5000 ch 0.91 0.76 0.83 2688 gb 0.83 0.77 0.80 5000 ie 0.82 0.88 0.85 5000 in 0.90 0.89 0.90 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.76 0.80 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.91 0.92 3788 us 0.83 0.86 0.84 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20212) (66476, 20212) (322587, 20184) (66476, 20184) STARTING UNMASKING ROUND 356 eng cc unigrams 356 precision recall f1-score support au 0.88 0.92 0.90 5000 ca 0.83 0.81 0.82 5000 ch 0.91 0.76 0.83 2688 gb 0.83 0.77 0.80 5000 ie 0.82 0.88 0.85 5000 in 0.90 0.89 0.89 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.85 0.76 0.80 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.90 0.92 3788 us 0.83 0.86 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20184) (66476, 20184) (322587, 20156) (66476, 20156) STARTING UNMASKING ROUND 357 eng cc unigrams 357 precision recall f1-score support au 0.87 0.92 0.90 5000 ca 0.83 0.81 0.82 5000 ch 0.90 0.76 0.83 2688 gb 0.83 0.77 0.80 5000 ie 0.82 0.88 0.85 5000 in 0.90 0.89 0.89 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.75 0.80 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.90 0.92 3788 us 0.84 0.86 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20156) (66476, 20156) (322587, 20128) (66476, 20128) STARTING UNMASKING ROUND 358 eng cc unigrams 358 precision recall f1-score support au 0.87 0.92 0.90 5000 ca 0.83 0.81 0.82 5000 ch 0.90 0.76 0.82 2688 gb 0.83 0.77 0.80 5000 ie 0.82 0.88 0.85 5000 in 0.90 0.89 0.89 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.75 0.79 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.90 0.92 3788 us 0.83 0.86 0.85 5000 za 0.81 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20128) (66476, 20128) (322587, 20100) (66476, 20100) STARTING UNMASKING ROUND 359 eng cc unigrams 359 precision recall f1-score support au 0.87 0.92 0.90 5000 ca 0.83 0.81 0.82 5000 ch 0.91 0.76 0.82 2688 gb 0.83 0.77 0.80 5000 ie 0.82 0.88 0.84 5000 in 0.90 0.89 0.89 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.75 0.79 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.97 5000 pt 0.93 0.90 0.92 3788 us 0.83 0.86 0.85 5000 za 0.80 0.89 0.85 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20100) (66476, 20100) (322587, 20072) (66476, 20072) STARTING UNMASKING ROUND 360 eng cc unigrams 360 precision recall f1-score support au 0.87 0.92 0.90 5000 ca 0.83 0.81 0.82 5000 ch 0.91 0.76 0.83 2688 gb 0.83 0.76 0.80 5000 ie 0.81 0.87 0.84 5000 in 0.90 0.89 0.89 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.75 0.79 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.96 5000 pt 0.93 0.90 0.91 3788 us 0.83 0.86 0.85 5000 za 0.80 0.89 0.84 5000 avg / total 0.87 0.87 0.87 66476 Reducing feature vectors. (322587, 20072) (66476, 20072) (322587, 20044) (66476, 20044) STARTING UNMASKING ROUND 361 eng cc unigrams 361 precision recall f1-score support au 0.87 0.92 0.89 5000 ca 0.83 0.80 0.82 5000 ch 0.91 0.76 0.83 2688 gb 0.83 0.76 0.79 5000 ie 0.82 0.87 0.84 5000 in 0.89 0.88 0.89 5000 my 0.85 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.74 0.79 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.96 5000 pt 0.93 0.90 0.92 3788 us 0.83 0.85 0.84 5000 za 0.80 0.89 0.84 5000 avg / total 0.87 0.87 0.86 66476 Reducing feature vectors. (322587, 20044) (66476, 20044) (322587, 20016) (66476, 20016) STARTING UNMASKING ROUND 362 eng cc unigrams 362 precision recall f1-score support au 0.87 0.92 0.89 5000 ca 0.83 0.80 0.82 5000 ch 0.91 0.76 0.83 2688 gb 0.83 0.76 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.89 0.89 0.89 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.75 0.79 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.96 5000 pt 0.93 0.90 0.92 3788 us 0.83 0.85 0.84 5000 za 0.80 0.88 0.84 5000 avg / total 0.87 0.87 0.86 66476 Reducing feature vectors. (322587, 20016) (66476, 20016) (322587, 19988) (66476, 19988) STARTING UNMASKING ROUND 363 eng cc unigrams 363 precision recall f1-score support au 0.87 0.92 0.89 5000 ca 0.83 0.80 0.82 5000 ch 0.91 0.76 0.83 2688 gb 0.83 0.76 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.90 0.89 0.89 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.74 0.79 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.96 5000 pt 0.93 0.90 0.92 3788 us 0.83 0.85 0.84 5000 za 0.80 0.89 0.84 5000 avg / total 0.87 0.86 0.86 66476 Reducing feature vectors. (322587, 19988) (66476, 19988) (322587, 19960) (66476, 19960) STARTING UNMASKING ROUND 364 eng cc unigrams 364 precision recall f1-score support au 0.87 0.92 0.89 5000 ca 0.83 0.80 0.81 5000 ch 0.91 0.76 0.83 2688 gb 0.83 0.76 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.90 0.89 0.89 5000 my 0.86 0.86 0.86 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.74 0.79 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.96 5000 pt 0.93 0.90 0.92 3788 us 0.83 0.85 0.84 5000 za 0.80 0.89 0.84 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19960) (66476, 19960) (322587, 19932) (66476, 19932) STARTING UNMASKING ROUND 365 eng cc unigrams 365 precision recall f1-score support au 0.87 0.92 0.89 5000 ca 0.83 0.80 0.81 5000 ch 0.91 0.75 0.83 2688 gb 0.83 0.76 0.79 5000 ie 0.81 0.88 0.84 5000 in 0.90 0.88 0.89 5000 my 0.85 0.86 0.85 5000 ng 0.90 0.96 0.93 5000 nz 0.83 0.74 0.78 5000 ph 0.90 0.90 0.90 5000 pk 0.96 0.97 0.96 5000 pt 0.93 0.90 0.92 3788 us 0.83 0.85 0.84 5000 za 0.80 0.88 0.84 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19932) (66476, 19932) (322587, 19905) (66476, 19905) STARTING UNMASKING ROUND 366 eng cc unigrams 366 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.82 0.80 0.81 5000 ch 0.91 0.75 0.83 2688 gb 0.83 0.76 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.90 0.88 0.89 5000 my 0.85 0.85 0.85 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.73 0.78 5000 ph 0.90 0.89 0.90 5000 pk 0.96 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.83 0.85 0.84 5000 za 0.79 0.88 0.84 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19905) (66476, 19905) (322587, 19877) (66476, 19877) STARTING UNMASKING ROUND 367 eng cc unigrams 367 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.82 0.80 0.81 5000 ch 0.91 0.75 0.82 2688 gb 0.83 0.76 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.89 0.88 0.89 5000 my 0.85 0.85 0.85 5000 ng 0.90 0.96 0.93 5000 nz 0.84 0.73 0.78 5000 ph 0.89 0.89 0.89 5000 pk 0.96 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.83 0.85 0.84 5000 za 0.79 0.88 0.84 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19877) (66476, 19877) (322587, 19851) (66476, 19851) STARTING UNMASKING ROUND 368 eng cc unigrams 368 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.82 0.80 0.81 5000 ch 0.91 0.75 0.82 2688 gb 0.83 0.75 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.89 0.88 0.89 5000 my 0.85 0.85 0.85 5000 ng 0.90 0.96 0.93 5000 nz 0.83 0.73 0.78 5000 ph 0.89 0.89 0.89 5000 pk 0.96 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.83 0.85 0.84 5000 za 0.79 0.88 0.84 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19851) (66476, 19851) (322587, 19823) (66476, 19823) STARTING UNMASKING ROUND 369 eng cc unigrams 369 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.82 0.80 0.81 5000 ch 0.91 0.75 0.82 2688 gb 0.83 0.75 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.89 0.88 0.89 5000 my 0.85 0.85 0.85 5000 ng 0.89 0.96 0.92 5000 nz 0.83 0.73 0.78 5000 ph 0.89 0.89 0.89 5000 pk 0.96 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.83 0.84 0.83 5000 za 0.79 0.88 0.83 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19823) (66476, 19823) (322587, 19795) (66476, 19795) STARTING UNMASKING ROUND 370 eng cc unigrams 370 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.82 0.80 0.81 5000 ch 0.91 0.75 0.82 2688 gb 0.83 0.75 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.89 0.88 0.89 5000 my 0.85 0.85 0.85 5000 ng 0.90 0.96 0.93 5000 nz 0.83 0.73 0.78 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.82 0.85 0.83 5000 za 0.79 0.88 0.83 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19795) (66476, 19795) (322587, 19767) (66476, 19767) STARTING UNMASKING ROUND 371 eng cc unigrams 371 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.82 0.80 0.81 5000 ch 0.91 0.75 0.82 2688 gb 0.83 0.75 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.89 0.88 0.89 5000 my 0.85 0.85 0.85 5000 ng 0.90 0.96 0.93 5000 nz 0.83 0.73 0.78 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.82 0.85 0.83 5000 za 0.79 0.88 0.83 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19767) (66476, 19767) (322587, 19739) (66476, 19739) STARTING UNMASKING ROUND 372 eng cc unigrams 372 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.82 0.80 0.81 5000 ch 0.91 0.75 0.82 2688 gb 0.83 0.75 0.79 5000 ie 0.81 0.88 0.84 5000 in 0.89 0.88 0.89 5000 my 0.84 0.85 0.85 5000 ng 0.89 0.96 0.92 5000 nz 0.83 0.73 0.78 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.83 0.84 0.83 5000 za 0.79 0.88 0.83 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19739) (66476, 19739) (322587, 19711) (66476, 19711) STARTING UNMASKING ROUND 373 eng cc unigrams 373 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.82 0.80 0.81 5000 ch 0.91 0.75 0.82 2688 gb 0.83 0.75 0.79 5000 ie 0.81 0.87 0.84 5000 in 0.89 0.88 0.89 5000 my 0.84 0.85 0.84 5000 ng 0.89 0.96 0.92 5000 nz 0.83 0.72 0.77 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.82 0.84 0.83 5000 za 0.79 0.88 0.83 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19711) (66476, 19711) (322587, 19684) (66476, 19684) STARTING UNMASKING ROUND 374 eng cc unigrams 374 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.81 0.80 0.81 5000 ch 0.91 0.74 0.82 2688 gb 0.83 0.74 0.78 5000 ie 0.81 0.87 0.84 5000 in 0.89 0.88 0.88 5000 my 0.84 0.84 0.84 5000 ng 0.89 0.96 0.92 5000 nz 0.83 0.72 0.77 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.82 0.84 0.83 5000 za 0.79 0.88 0.83 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19684) (66476, 19684) (322587, 19656) (66476, 19656) STARTING UNMASKING ROUND 375 eng cc unigrams 375 precision recall f1-score support au 0.86 0.92 0.89 5000 ca 0.82 0.80 0.81 5000 ch 0.91 0.74 0.82 2688 gb 0.83 0.74 0.78 5000 ie 0.81 0.87 0.84 5000 in 0.89 0.88 0.88 5000 my 0.84 0.84 0.84 5000 ng 0.89 0.96 0.92 5000 nz 0.83 0.72 0.77 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.82 0.84 0.83 5000 za 0.78 0.88 0.83 5000 avg / total 0.86 0.86 0.86 66476 Reducing feature vectors. (322587, 19656) (66476, 19656) (322587, 19628) (66476, 19628) STARTING UNMASKING ROUND 376 eng cc unigrams 376 precision recall f1-score support au 0.86 0.92 0.88 5000 ca 0.82 0.79 0.80 5000 ch 0.91 0.74 0.81 2688 gb 0.82 0.73 0.78 5000 ie 0.80 0.87 0.84 5000 in 0.89 0.88 0.88 5000 my 0.84 0.85 0.84 5000 ng 0.89 0.96 0.92 5000 nz 0.83 0.72 0.77 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.82 0.84 0.83 5000 za 0.78 0.88 0.83 5000 avg / total 0.86 0.86 0.85 66476 Reducing feature vectors. (322587, 19628) (66476, 19628) (322587, 19600) (66476, 19600) STARTING UNMASKING ROUND 377 eng cc unigrams 377 precision recall f1-score support au 0.85 0.92 0.88 5000 ca 0.81 0.79 0.80 5000 ch 0.91 0.73 0.81 2688 gb 0.83 0.73 0.78 5000 ie 0.80 0.87 0.83 5000 in 0.89 0.88 0.88 5000 my 0.84 0.84 0.84 5000 ng 0.89 0.96 0.92 5000 nz 0.83 0.71 0.77 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.90 0.91 3788 us 0.82 0.84 0.83 5000 za 0.78 0.88 0.83 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19600) (66476, 19600) (322587, 19574) (66476, 19574) STARTING UNMASKING ROUND 378 eng cc unigrams 378 precision recall f1-score support au 0.85 0.92 0.88 5000 ca 0.82 0.79 0.80 5000 ch 0.91 0.73 0.81 2688 gb 0.82 0.73 0.77 5000 ie 0.80 0.87 0.83 5000 in 0.89 0.88 0.88 5000 my 0.84 0.84 0.84 5000 ng 0.88 0.96 0.92 5000 nz 0.83 0.71 0.77 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.91 3788 us 0.82 0.84 0.83 5000 za 0.78 0.88 0.83 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19574) (66476, 19574) (322587, 19546) (66476, 19546) STARTING UNMASKING ROUND 379 eng cc unigrams 379 precision recall f1-score support au 0.85 0.92 0.88 5000 ca 0.82 0.79 0.80 5000 ch 0.91 0.73 0.81 2688 gb 0.82 0.73 0.77 5000 ie 0.79 0.87 0.83 5000 in 0.89 0.87 0.88 5000 my 0.84 0.84 0.84 5000 ng 0.88 0.96 0.92 5000 nz 0.83 0.71 0.77 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.90 3788 us 0.82 0.84 0.83 5000 za 0.78 0.88 0.83 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19546) (66476, 19546) (322587, 19518) (66476, 19518) STARTING UNMASKING ROUND 380 eng cc unigrams 380 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.82 0.79 0.80 5000 ch 0.91 0.73 0.81 2688 gb 0.82 0.73 0.77 5000 ie 0.79 0.87 0.83 5000 in 0.89 0.87 0.88 5000 my 0.84 0.84 0.84 5000 ng 0.88 0.96 0.92 5000 nz 0.83 0.71 0.76 5000 ph 0.88 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.91 3788 us 0.82 0.84 0.83 5000 za 0.78 0.88 0.83 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19518) (66476, 19518) (322587, 19490) (66476, 19490) STARTING UNMASKING ROUND 381 eng cc unigrams 381 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.81 0.79 0.80 5000 ch 0.91 0.73 0.81 2688 gb 0.82 0.73 0.77 5000 ie 0.79 0.87 0.83 5000 in 0.89 0.87 0.88 5000 my 0.84 0.84 0.84 5000 ng 0.88 0.96 0.92 5000 nz 0.83 0.71 0.77 5000 ph 0.89 0.89 0.89 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.90 3788 us 0.81 0.84 0.83 5000 za 0.78 0.88 0.82 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19490) (66476, 19490) (322587, 19462) (66476, 19462) STARTING UNMASKING ROUND 382 eng cc unigrams 382 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.81 0.79 0.80 5000 ch 0.91 0.73 0.81 2688 gb 0.82 0.72 0.77 5000 ie 0.79 0.87 0.83 5000 in 0.89 0.87 0.88 5000 my 0.83 0.84 0.83 5000 ng 0.88 0.96 0.92 5000 nz 0.83 0.70 0.76 5000 ph 0.88 0.89 0.88 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.91 3788 us 0.81 0.84 0.82 5000 za 0.78 0.88 0.82 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19462) (66476, 19462) (322587, 19434) (66476, 19434) STARTING UNMASKING ROUND 383 eng cc unigrams 383 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.81 0.79 0.80 5000 ch 0.91 0.73 0.81 2688 gb 0.82 0.72 0.77 5000 ie 0.79 0.87 0.83 5000 in 0.89 0.87 0.88 5000 my 0.83 0.84 0.84 5000 ng 0.88 0.95 0.92 5000 nz 0.83 0.70 0.76 5000 ph 0.88 0.89 0.88 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.91 3788 us 0.81 0.84 0.82 5000 za 0.78 0.87 0.82 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19434) (66476, 19434) (322587, 19406) (66476, 19406) STARTING UNMASKING ROUND 384 eng cc unigrams 384 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.81 0.79 0.80 5000 ch 0.91 0.73 0.81 2688 gb 0.82 0.72 0.77 5000 ie 0.79 0.87 0.82 5000 in 0.89 0.87 0.88 5000 my 0.83 0.83 0.83 5000 ng 0.88 0.95 0.91 5000 nz 0.83 0.70 0.76 5000 ph 0.88 0.89 0.88 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.90 3788 us 0.81 0.84 0.82 5000 za 0.77 0.87 0.82 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19406) (66476, 19406) (322587, 19378) (66476, 19378) STARTING UNMASKING ROUND 385 eng cc unigrams 385 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.81 0.79 0.80 5000 ch 0.90 0.72 0.80 2688 gb 0.82 0.72 0.77 5000 ie 0.79 0.87 0.82 5000 in 0.89 0.87 0.88 5000 my 0.83 0.83 0.83 5000 ng 0.88 0.95 0.91 5000 nz 0.83 0.70 0.76 5000 ph 0.88 0.89 0.88 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.77 0.87 0.82 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19378) (66476, 19378) (322587, 19350) (66476, 19350) STARTING UNMASKING ROUND 386 eng cc unigrams 386 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.80 0.79 0.80 5000 ch 0.90 0.72 0.80 2688 gb 0.82 0.72 0.77 5000 ie 0.78 0.87 0.82 5000 in 0.89 0.87 0.88 5000 my 0.83 0.84 0.83 5000 ng 0.88 0.95 0.91 5000 nz 0.83 0.70 0.76 5000 ph 0.88 0.89 0.88 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.77 0.87 0.82 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19350) (66476, 19350) (322587, 19322) (66476, 19322) STARTING UNMASKING ROUND 387 eng cc unigrams 387 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.80 0.79 0.80 5000 ch 0.90 0.72 0.80 2688 gb 0.82 0.72 0.77 5000 ie 0.79 0.87 0.83 5000 in 0.89 0.87 0.88 5000 my 0.83 0.83 0.83 5000 ng 0.88 0.95 0.92 5000 nz 0.83 0.70 0.76 5000 ph 0.88 0.89 0.88 5000 pk 0.95 0.97 0.96 5000 pt 0.92 0.89 0.90 3788 us 0.82 0.83 0.82 5000 za 0.77 0.87 0.82 5000 avg / total 0.85 0.85 0.85 66476 Reducing feature vectors. (322587, 19322) (66476, 19322) (322587, 19294) (66476, 19294) STARTING UNMASKING ROUND 388 eng cc unigrams 388 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.80 0.79 0.79 5000 ch 0.90 0.72 0.80 2688 gb 0.82 0.72 0.77 5000 ie 0.78 0.87 0.82 5000 in 0.88 0.87 0.88 5000 my 0.83 0.83 0.83 5000 ng 0.88 0.95 0.92 5000 nz 0.82 0.70 0.76 5000 ph 0.88 0.89 0.88 5000 pk 0.95 0.97 0.96 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.77 0.87 0.82 5000 avg / total 0.85 0.85 0.84 66476 Reducing feature vectors. (322587, 19294) (66476, 19294) (322587, 19266) (66476, 19266) STARTING UNMASKING ROUND 389 eng cc unigrams 389 precision recall f1-score support au 0.85 0.91 0.88 5000 ca 0.80 0.79 0.80 5000 ch 0.90 0.72 0.80 2688 gb 0.82 0.72 0.77 5000 ie 0.78 0.87 0.82 5000 in 0.88 0.87 0.88 5000 my 0.83 0.83 0.83 5000 ng 0.88 0.95 0.92 5000 nz 0.82 0.70 0.75 5000 ph 0.88 0.88 0.88 5000 pk 0.95 0.96 0.96 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.77 0.87 0.82 5000 avg / total 0.85 0.85 0.84 66476 Reducing feature vectors. (322587, 19266) (66476, 19266) (322587, 19238) (66476, 19238) STARTING UNMASKING ROUND 390 eng cc unigrams 390 precision recall f1-score support au 0.84 0.91 0.88 5000 ca 0.80 0.79 0.79 5000 ch 0.90 0.71 0.80 2688 gb 0.82 0.72 0.76 5000 ie 0.78 0.87 0.82 5000 in 0.88 0.87 0.88 5000 my 0.83 0.83 0.83 5000 ng 0.88 0.95 0.91 5000 nz 0.82 0.69 0.75 5000 ph 0.88 0.88 0.88 5000 pk 0.95 0.96 0.96 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.77 0.87 0.82 5000 avg / total 0.85 0.84 0.84 66476 Reducing feature vectors. (322587, 19238) (66476, 19238) (322587, 19210) (66476, 19210) STARTING UNMASKING ROUND 391 eng cc unigrams 391 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.80 0.79 0.79 5000 ch 0.90 0.71 0.79 2688 gb 0.82 0.71 0.76 5000 ie 0.78 0.87 0.82 5000 in 0.88 0.87 0.88 5000 my 0.83 0.83 0.83 5000 ng 0.88 0.95 0.91 5000 nz 0.82 0.68 0.75 5000 ph 0.87 0.88 0.88 5000 pk 0.95 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.82 0.83 0.82 5000 za 0.77 0.87 0.82 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 19210) (66476, 19210) (322587, 19182) (66476, 19182) STARTING UNMASKING ROUND 392 eng cc unigrams 392 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.80 0.79 0.79 5000 ch 0.90 0.71 0.79 2688 gb 0.82 0.71 0.76 5000 ie 0.78 0.87 0.82 5000 in 0.88 0.87 0.87 5000 my 0.83 0.83 0.83 5000 ng 0.88 0.95 0.91 5000 nz 0.82 0.69 0.75 5000 ph 0.88 0.88 0.88 5000 pk 0.95 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.82 0.83 0.82 5000 za 0.77 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 19182) (66476, 19182) (322587, 19154) (66476, 19154) STARTING UNMASKING ROUND 393 eng cc unigrams 393 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.80 0.79 0.79 5000 ch 0.90 0.70 0.79 2688 gb 0.81 0.71 0.76 5000 ie 0.78 0.87 0.82 5000 in 0.88 0.87 0.87 5000 my 0.83 0.83 0.83 5000 ng 0.88 0.95 0.91 5000 nz 0.82 0.68 0.74 5000 ph 0.87 0.88 0.88 5000 pk 0.95 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.77 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 19154) (66476, 19154) (322587, 19126) (66476, 19126) STARTING UNMASKING ROUND 394 eng cc unigrams 394 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.80 0.78 0.79 5000 ch 0.90 0.70 0.79 2688 gb 0.81 0.71 0.76 5000 ie 0.78 0.87 0.82 5000 in 0.88 0.87 0.87 5000 my 0.83 0.83 0.83 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.68 0.75 5000 ph 0.88 0.88 0.88 5000 pk 0.95 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.76 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 19126) (66476, 19126) (322587, 19098) (66476, 19098) STARTING UNMASKING ROUND 395 eng cc unigrams 395 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.80 0.78 0.79 5000 ch 0.90 0.70 0.79 2688 gb 0.81 0.71 0.76 5000 ie 0.78 0.87 0.82 5000 in 0.88 0.87 0.87 5000 my 0.82 0.83 0.83 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.68 0.74 5000 ph 0.88 0.88 0.88 5000 pk 0.95 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.77 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 19098) (66476, 19098) (322587, 19070) (66476, 19070) STARTING UNMASKING ROUND 396 eng cc unigrams 396 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.80 0.78 0.79 5000 ch 0.90 0.70 0.79 2688 gb 0.81 0.71 0.76 5000 ie 0.78 0.86 0.82 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.83 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.68 0.74 5000 ph 0.87 0.88 0.88 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.76 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 19070) (66476, 19070) (322587, 19042) (66476, 19042) STARTING UNMASKING ROUND 397 eng cc unigrams 397 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.79 0.78 0.79 5000 ch 0.90 0.70 0.79 2688 gb 0.81 0.70 0.76 5000 ie 0.77 0.86 0.82 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.83 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.68 0.74 5000 ph 0.87 0.88 0.88 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.83 0.82 5000 za 0.76 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 19042) (66476, 19042) (322587, 19015) (66476, 19015) STARTING UNMASKING ROUND 398 eng cc unigrams 398 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.79 0.78 0.79 5000 ch 0.90 0.70 0.79 2688 gb 0.81 0.71 0.76 5000 ie 0.77 0.87 0.82 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.83 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.68 0.74 5000 ph 0.87 0.88 0.88 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.88 0.90 3788 us 0.81 0.82 0.82 5000 za 0.76 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 19015) (66476, 19015) (322587, 18987) (66476, 18987) STARTING UNMASKING ROUND 399 eng cc unigrams 399 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.80 0.78 0.79 5000 ch 0.90 0.70 0.78 2688 gb 0.81 0.70 0.75 5000 ie 0.77 0.87 0.82 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.83 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.68 0.74 5000 ph 0.87 0.88 0.88 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.88 0.90 3788 us 0.81 0.83 0.82 5000 za 0.76 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 18987) (66476, 18987) (322587, 18960) (66476, 18960) STARTING UNMASKING ROUND 400 eng cc unigrams 400 precision recall f1-score support au 0.84 0.91 0.87 5000 ca 0.79 0.78 0.79 5000 ch 0.90 0.69 0.78 2688 gb 0.81 0.70 0.75 5000 ie 0.77 0.86 0.82 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.83 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.68 0.74 5000 ph 0.87 0.88 0.88 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.88 0.90 3788 us 0.81 0.83 0.82 5000 za 0.76 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 18960) (66476, 18960) (322587, 18932) (66476, 18932) STARTING UNMASKING ROUND 401 eng cc unigrams 401 precision recall f1-score support au 0.83 0.91 0.87 5000 ca 0.79 0.78 0.79 5000 ch 0.90 0.69 0.78 2688 gb 0.81 0.70 0.75 5000 ie 0.77 0.86 0.82 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.83 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.67 0.74 5000 ph 0.87 0.88 0.88 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.82 0.82 5000 za 0.76 0.87 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 18932) (66476, 18932) (322587, 18904) (66476, 18904) STARTING UNMASKING ROUND 402 eng cc unigrams 402 precision recall f1-score support au 0.83 0.90 0.87 5000 ca 0.79 0.78 0.78 5000 ch 0.90 0.69 0.78 2688 gb 0.81 0.69 0.75 5000 ie 0.77 0.86 0.81 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.83 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.67 0.74 5000 ph 0.87 0.88 0.88 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.88 0.90 3788 us 0.81 0.82 0.81 5000 za 0.76 0.86 0.81 5000 avg / total 0.84 0.84 0.84 66476 Reducing feature vectors. (322587, 18904) (66476, 18904) (322587, 18877) (66476, 18877) STARTING UNMASKING ROUND 403 eng cc unigrams 403 precision recall f1-score support au 0.83 0.90 0.87 5000 ca 0.79 0.78 0.78 5000 ch 0.90 0.69 0.78 2688 gb 0.81 0.69 0.75 5000 ie 0.77 0.86 0.81 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.82 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.67 0.73 5000 ph 0.87 0.88 0.88 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.88 0.90 3788 us 0.80 0.82 0.81 5000 za 0.76 0.87 0.81 5000 avg / total 0.84 0.84 0.83 66476 Reducing feature vectors. (322587, 18877) (66476, 18877) (322587, 18849) (66476, 18849) STARTING UNMASKING ROUND 404 eng cc unigrams 404 precision recall f1-score support au 0.83 0.91 0.87 5000 ca 0.79 0.78 0.78 5000 ch 0.90 0.69 0.78 2688 gb 0.81 0.69 0.74 5000 ie 0.77 0.86 0.81 5000 in 0.87 0.86 0.87 5000 my 0.82 0.83 0.82 5000 ng 0.87 0.95 0.91 5000 nz 0.82 0.66 0.73 5000 ph 0.87 0.88 0.88 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.88 0.89 3788 us 0.81 0.82 0.81 5000 za 0.75 0.87 0.81 5000 avg / total 0.84 0.83 0.83 66476 Reducing feature vectors. (322587, 18849) (66476, 18849) (322587, 18822) (66476, 18822) STARTING UNMASKING ROUND 405 eng cc unigrams 405 precision recall f1-score support au 0.83 0.90 0.87 5000 ca 0.79 0.77 0.78 5000 ch 0.90 0.69 0.78 2688 gb 0.81 0.69 0.74 5000 ie 0.76 0.86 0.81 5000 in 0.87 0.86 0.87 5000 my 0.82 0.83 0.82 5000 ng 0.87 0.95 0.91 5000 nz 0.81 0.66 0.73 5000 ph 0.87 0.88 0.87 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.88 0.90 3788 us 0.80 0.82 0.81 5000 za 0.75 0.87 0.81 5000 avg / total 0.84 0.83 0.83 66476 Reducing feature vectors. (322587, 18822) (66476, 18822) (322587, 18794) (66476, 18794) STARTING UNMASKING ROUND 406 eng cc unigrams 406 precision recall f1-score support au 0.83 0.91 0.87 5000 ca 0.79 0.77 0.78 5000 ch 0.90 0.69 0.78 2688 gb 0.81 0.68 0.74 5000 ie 0.76 0.87 0.81 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.82 5000 ng 0.86 0.95 0.90 5000 nz 0.81 0.66 0.73 5000 ph 0.87 0.88 0.87 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.81 0.82 0.81 5000 za 0.75 0.87 0.81 5000 avg / total 0.84 0.83 0.83 66476 Reducing feature vectors. (322587, 18794) (66476, 18794) (322587, 18766) (66476, 18766) STARTING UNMASKING ROUND 407 eng cc unigrams 407 precision recall f1-score support au 0.83 0.91 0.86 5000 ca 0.79 0.77 0.78 5000 ch 0.90 0.68 0.78 2688 gb 0.81 0.68 0.74 5000 ie 0.76 0.87 0.81 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.82 5000 ng 0.87 0.95 0.91 5000 nz 0.81 0.66 0.73 5000 ph 0.87 0.88 0.87 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.89 0.90 3788 us 0.80 0.82 0.81 5000 za 0.76 0.87 0.81 5000 avg / total 0.83 0.83 0.83 66476 Reducing feature vectors. (322587, 18766) (66476, 18766) (322587, 18738) (66476, 18738) STARTING UNMASKING ROUND 408 eng cc unigrams 408 precision recall f1-score support au 0.82 0.90 0.86 5000 ca 0.79 0.77 0.78 5000 ch 0.90 0.68 0.78 2688 gb 0.81 0.67 0.74 5000 ie 0.76 0.87 0.81 5000 in 0.87 0.85 0.86 5000 my 0.82 0.83 0.82 5000 ng 0.87 0.95 0.91 5000 nz 0.81 0.66 0.73 5000 ph 0.87 0.88 0.87 5000 pk 0.94 0.96 0.95 5000 pt 0.90 0.89 0.89 3788 us 0.80 0.82 0.81 5000 za 0.75 0.86 0.81 5000 avg / total 0.83 0.83 0.83 66476 Reducing feature vectors. (322587, 18738) (66476, 18738) (322587, 18710) (66476, 18710) STARTING UNMASKING ROUND 409 eng cc unigrams 409 precision recall f1-score support au 0.82 0.90 0.86 5000 ca 0.78 0.77 0.78 5000 ch 0.90 0.68 0.77 2688 gb 0.81 0.67 0.74 5000 ie 0.76 0.86 0.81 5000 in 0.87 0.86 0.86 5000 my 0.82 0.83 0.82 5000 ng 0.86 0.95 0.90 5000 nz 0.82 0.65 0.73 5000 ph 0.87 0.88 0.87 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.88 0.89 3788 us 0.80 0.81 0.81 5000 za 0.75 0.87 0.80 5000 avg / total 0.83 0.83 0.83 66476 Reducing feature vectors. (322587, 18710) (66476, 18710) (322587, 18682) (66476, 18682) STARTING UNMASKING ROUND 410 eng cc unigrams 410 precision recall f1-score support au 0.82 0.90 0.86 5000 ca 0.78 0.77 0.78 5000 ch 0.89 0.68 0.77 2688 gb 0.81 0.67 0.74 5000 ie 0.76 0.87 0.81 5000 in 0.87 0.85 0.86 5000 my 0.82 0.82 0.82 5000 ng 0.86 0.95 0.90 5000 nz 0.81 0.65 0.72 5000 ph 0.87 0.88 0.87 5000 pk 0.94 0.96 0.95 5000 pt 0.91 0.88 0.89 3788 us 0.80 0.81 0.81 5000 za 0.75 0.87 0.81 5000 avg / total 0.83 0.83 0.83 66476 Reducing feature vectors. (322587, 18682) (66476, 18682) (322587, 18655) (66476, 18655) STARTING UNMASKING ROUND 411 eng cc unigrams 411 precision recall f1-score support au 0.82 0.90 0.86 5000 ca 0.78 0.77 0.77 5000 ch 0.90 0.68 0.77 2688 gb 0.81 0.67 0.73 5000 ie 0.75 0.86 0.80 5000 in 0.88 0.86 0.87 5000 my 0.82 0.83 0.82 5000 ng 0.86 0.95 0.90 5000 nz 0.81 0.65 0.72 5000 ph 0.87 0.87 0.87 5000 pk 0.94 0.96 0.95 5000 pt 0.90 0.88 0.89 3788 us 0.80 0.81 0.81 5000 za 0.75 0.86 0.80 5000 avg / total 0.83 0.83 0.83 66476 Reducing feature vectors. (322587, 18655) (66476, 18655) (322587, 18627) (66476, 18627) STARTING UNMASKING ROUND 412 eng cc unigrams 412 precision recall f1-score support au 0.82 0.90 0.86 5000 ca 0.78 0.77 0.77 5000 ch 0.90 0.68 0.77 2688 gb 0.81 0.66 0.73 5000 ie 0.75 0.86 0.80 5000 in 0.88 0.85 0.87 5000 my 0.81 0.82 0.82 5000 ng 0.86 0.95 0.90 5000 nz 0.81 0.65 0.72 5000 ph 0.86 0.87 0.87 5000 pk 0.94 0.96 0.95 5000 pt 0.90 0.88 0.89 3788 us 0.80 0.81 0.81 5000 za 0.75 0.86 0.80 5000 avg / total 0.83 0.83 0.83 66476 Reducing feature vectors. (322587, 18627) (66476, 18627) (322587, 18599) (66476, 18599)