相似度论文再回顾
创始人
2024-03-06 13:53:57

Towards a Unified Multi-Dimensional Evaluator for Text Generation

多个维度出发评价生成文本的质量,如一致性、流畅度等等。

每个维度的伪标注样本数量为30K,作者构建的数据集:

we first design specific rules for several commonly evaluated dimensions to construct pseudo data, and then combine them to train the evaluator.

任务形式:summary和dialogue。

实验验证:对比model有BLEU、METHOR、ROUGE、Bertscore....

人工标注的数据:TO verfify the proposed evaluator is qualifited, we need to calculated correlations with human scores in each benchamark.

Train the evaluator for 1-3 epochs. _Supervised method.

BARTSCORE: Evaluating Generated Text as Text Generation

Conditional text generation: for example,machine translation, so the goal is to generate a hypothesis (h = h1, · · · , hm) based on a given source text (s = s1, · · · , sn)

require human judgments to train (i.e., supervised me

相关内容

热门资讯

春节发视频,别踩这些红线! 转自:漯河发布近几天视频大模型Seedance2.0火了据称“通过几句简短的提示词就能生成电影级的视...
【新春走基层·欢乐闹新春】芬芳... 春节临近,江西南昌市西湖区九洲公园迎春花市区域内,摆满鲜花的摊位已次第摆开,蝴蝶兰雅致、富贵竹青翠、...
发展优先与务实合作——慕安会上... (来源:上观新闻)在全球格局快速重塑、地缘政治竞争加剧的背景下,全球南方国家正以更积极务实的姿态参与...
新春走基层 | 腊月学“本事”... 春节的脚步日渐临近,大街小巷年味愈发浓郁,大红灯笼缀满枝头,往来行人拎着沉甸甸的年货,暖意融融。在胶...
小观看天丨风雨就位!注意添衣保... 气象万千,小观看天!小伙伴们,早上好!今天是2月15日,农历腊月二十八,星期日。春节假期第一天,风雨...