ROUGE, or Recall-Oriented Understudy for Gisting Evaluation, is a set of metrics and a software package used for evaluating automatic summarization and machine translation software in natural language processing. The metrics compare an automatically produced summary or translation against a reference or a set of references (human-produced) summary or translation.
Metrics
The following five evaluation metrics are available.
ROUGE can be downloaded from berouge download link.
References
ROUGE (metric) Wikipedia(Text) CC BY-SA