Walle
Created page with "{{see also|Machine learning terms}} ==Introduction== The '''Bilingual Evaluation Understudy''' ('''BLEU''') is an automatic evaluation metric used in the field of Natural Language Processing (NLP) to measure the quality of machine-generated translations. Developed by IBM Research in 2002, it compares translations generated by a machine with a set of human-generated reference translations. BLEU scores are widely used in the evaluation of machine translation system..."