Following are the metrics we need to implement based on [jury](https://github.com/obss/jury#available-metrics) - [x] Bleu score - [ ] sequence evaluation (NER) - [x] ROUGE - [x] Meteor - [ ] Perplexity All these metrics could be derived through `evalem.metrics._base.JuryBasedMetric` (see `evalem.metrics.basics` module for some general implementation) --- cc: @muthukumaranR @xhagrg @code-geek