“听其言，观其行”：大模型价值倾向表征的多重进路

首页 > 过刊浏览>2025年第卷第7期 >169-179

“听其言，观其行”：大模型价值倾向表征的多重进路
DOI:
                        
CSTR:
                        
作者:
                        
作者单位:
作者简介:
通讯作者:
中图分类号:TP18;C931
基金项目:新一代人工智能国家科技重大专项“人工智能社会实验伦理、评估与标准化研究”（2023ZD0121600）；国家自然科学基金青年项目“声誉管理对政府数智化转型公众接受度的影响机制与路径研究”（72204138）。

From discourse to behavior: multiple approaches to representing the value system of large language models

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

随着大模型智能水平的提升，越来越多的观点开始将其视为一类准社会主体，驱动着以智能体为对象的机器行为研究的兴起。其中一个重要问题是大模型正在不同维度上展现出与人类相似的价值倾向性。精准识别大模型的价值倾向性，有助于发展面向智能体的行为科学，并为更加高效、安全的人机交互奠定基础。现有文献已从不同视角开展大模型价值倾向性的研究探索，但尚未形成系统的方法论体系。通过综合借鉴行为科学、多模态信息分析、博弈论等交叉学科研究进展，构建基于“分析尺度—表征逻辑”的类型学分析框架，阐释大模型价值倾向性研究的4个代表性进路，并借助每个路径中的研究实例，详细剖析大模型价值倾向性的表征方法，有助于进一步深化对大模型价值倾向性的系统性认识，为丰富和拓展面向智能体的行为科学提供新的方法论视角。

Abstract:

As large language models (LLMs) continue to advance in intelligence, they are increasingly perceived as quasi-social agents, spurring the emergence of machine behavior studies centered on intelligent systems. A key issue in this domain is the extent to which LLMs exhibit human-like value alignments across various dimensions. Accurate identification of such tendencies is essential for developing a behavioral science of intelligent agents and fostering safer, more effective human-AI interaction. Existing literature has initiated exploratory inquiries from multiple perspectives, yet a systematic methodological framework remains lacking. This paper contributes to the academic discourse on value alignment in LLMs by proposing a typological framework based on the axes of “analytical scale” and “representational logic.” Through the analysis of four representative research approaches, the paper delineates core methodologies for examining value tendencies in LLMs. The findings aim to advance systematic understanding in this area and offer a novel methodological lens for the behavioral science of emerging LLM agents.

参考文献

相似文献

引证文献

引用本文

吕立远,李延昊,王健骁,魏钰明,苏竣.“听其言，观其行”：大模型价值倾向表征的多重进路[J].中国软科学,2025,(7):169-179

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2026-06-03
出版日期:

首页

杂志简介

投稿须知

投稿格式

联系我们

引用本文

分享

相关视频

文章指标

历史

文章二维码