You can download the NL-TR word level language identification dataset here:

dataset-emnlp2013.zip

If you use this data, please cite:

D. Nguyen, A.S. Dogruöz : Word Level Language Identification in Online Multilingual Communication at EMNLP 2013.

For questions, contact Dong Nguyen.