相似度论文再回顾
创始人
2024-03-06 13:53:57

Towards a Unified Multi-Dimensional Evaluator for Text Generation

多个维度出发评价生成文本的质量,如一致性、流畅度等等。

每个维度的伪标注样本数量为30K,作者构建的数据集:

we first design specific rules for several commonly evaluated dimensions to construct pseudo data, and then combine them to train the evaluator.

任务形式:summary和dialogue。

实验验证:对比model有BLEU、METHOR、ROUGE、Bertscore....

人工标注的数据:TO verfify the proposed evaluator is qualifited, we need to calculated correlations with human scores in each benchamark.

Train the evaluator for 1-3 epochs. _Supervised method.

BARTSCORE: Evaluating Generated Text as Text Generation

Conditional text generation: for example,machine translation, so the goal is to generate a hypothesis (h = h1, · · · , hm) based on a given source text (s = s1, · · · , sn)

require human judgments to train (i.e., supervised me

相关内容

热门资讯

中证A500ETF摩根(560... 8月22日,截止午间收盘,中证A500ETF摩根(560530)涨1.19%,报1.106元,成交额...
A500ETF易方达(1593... 8月22日,截止午间收盘,A500ETF易方达(159361)涨1.28%,报1.104元,成交额1...
何小鹏斥资约2.5亿港元增持小... 每经记者|孙磊    每经编辑|裴健如 8月21日晚间,小鹏汽车发布公告称,公司联...
中证500ETF基金(1593... 8月22日,截止午间收盘,中证500ETF基金(159337)涨0.94%,报1.509元,成交额2...
中证A500ETF华安(159... 8月22日,截止午间收盘,中证A500ETF华安(159359)涨1.15%,报1.139元,成交额...