大语言模型智能体驱动的出行日志生成技术

黄菲; 肖智雄; 李旭川; 李国隆; 乐阳

doi:10.13203/j.whugis20250277

摘要: 人类出行数据是洞察人地交互规律、解构城市复杂系统的基础。但真实出行数据获取难度大，且受限于隐私保护，已成为城市研究和地理过程建模等领域的关键数据瓶颈。大语言模型智能体（ Large Language Model Agent,LLM-Agent）已在人类认知决策模拟方面崭露头角，为城市出行数据合成提供了新思路。但面临的核心挑战在于：如何将LLM从通用推理器，变为可准确建模异质性出行行为与城市环境深层互动规律的模拟器。为此，本研究提出一个基于属性-行为自洽与空间吸引力双重约束的出行日志生成框架（Human Mobility Generation Agent,MobiA），通过注入结构化的出行关联知识，驱动Agent生成兼具个性化与地理真实性的时空活动。MobiA从大规模真实数据中挖掘社会属性与时空行为间的异质性规律，构建出行模式知识库。并借助检索增强生成技术将其动态注入Agent的层级决策过程，为个性化行为提供知识约束。其次，通过构建地块吸引力指标，量化空间语义，将抽象出行意图与真实城市环境精准匹配，完成地理空间约束。本研究基于20万出行调查数据进行训练，验证结果表明，MobiA合成数据在复现个性化出行链的真实性与宏观人流分布的地理相似性上，均与观测数据高度吻合。该框架为探索LLM-Agent模拟复杂的人-地交互应用提供了可行的技术路径，其合成结果可为以人为本的智慧城市建设与决策提供关键数据支撑。

Abstract: Objectives Human mobility diaries, represented as sequences of time–place–purpose events, provide a fundamental basis for understanding human–environment interactions and the dynamics of urban systems. Yet high-quality real-world mobility data are increasingly hard to obtain due to privacy constraints, collection costs, and sampling biases, creating a persistent bottleneck for urban studies and geographic process modeling. Large language model agents offer a new direction by simulating human-like decision making, but what does it take for a diary to be mobility-valid, not merely language-plausible? Specifically, how can an LLM generate activity chains that remain consistent with individual attributes and ground abstract intentions in locations that are both semantically appropriate and accessible? Methods To produce synthetic diaries that are both behaviorally plausible and spatially realistic for downstream urban analysis and policy evaluation, we propose LLM-Driven Human Mobility Generation Agent (MobiA), a travel-diary generation framework built on a dual anchoring mechanism i.e., Sociodemographic Grounding and Spatial Attractiveness. First, to enforce attribute–behavior self-consistency, MobiA mines fine-grained mobility groups from large-scale travel surveys and distills pattern–motivation knowledge into a retrievable knowledge base, which is further validated and updated through contextual evaluation. Second, to ensure geographically valid grounding, the city is represented as road-enclosed parcels, and an attractiveness model maps travel intentions to feasible parcels based on semantic supply and accessibility. Conditioned on an agent profile, MobiA performs hierarchical schedule planning and event refinement under retrieved constraints, and then grounds each event to parcel locations via the learned attractiveness scores. Results MobiA is trained and validated using a large-scale Shenzhen WeChat travel survey dataset, which contains 22,228 users, 199,380 trips, and 32,727 locations after preprocessing. (1) We conduct two controlled cohort experiments to test whether MobiA produces attribute-grounded personalization. For women aged 30–35 on weekday mornings, MobiA shifts destinations from a commute-only business-district cluster to increased school-area activity, consistent with family–work routines. For male car owners aged 30–35 with middle-to-high income on weekends, MobiA shifts trips from near-home leisure to longer-distance destinations such as parks, museums, seaside areas, and major shopping centers, demonstrating fine-grained demographic heterogeneity in generated diaries. (2) City-scale realism. We simulate ~50,000 residents and compare 24-hour parcel-level dynamic population against Baidu Maps. As shown in Fig.3, the cosine similarity exceeds 0.8 in >90% of parcels (weekdays and weekends), and >70% of parcels achieve density-fitting >0.5, indicating strong alignment of spatiotemporal population patterns at the parcel scale. Conclusions MobiA generates high-quality, controllable travel diaries with strong behavioral realism and geographic validity. Its dual grounding mechanism links sociodemographic attributes to daily routines and links urban semantics and accessibility to destination choices. We emphasize that individual mobility synthesis must respect both behavioral regularities and spatial feasibility to avoid plausible-butfloating diaries.

大语言模型智能体驱动的出行日志生成技术

Travel Diary Generation Based on Large Language Model Agent