feat(data): 盘点数据基建,并提取 2644 个连续不确定性极度遮挡样本#7
Open
JiafuZhang1 wants to merge 4 commits into
Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
UniDriveVLA 核心数据盘点与不确定性 (Uncertainty) 规划表
盘点人:JiafuZhang1 (二作 - 数据与预处理负责人)
环境状态:WSL2 + Conda 基建已通,已成功挂载 nuScenes v1.0-mini 验证集。
初步战果:已通过自动化探针脚本,在 5 个场景中成功定位并提取了 2644 个极度模糊/严重遮挡的优质长尾样本。
一、 数据字段全景盘点表
针对 UniDriveVLA 端到端基座的三个专家分支,当前原生数据集 (nuScenes) 的支持状态如下:
CAM_FRONT,CAM_BACK...bboxes_3dmap_layersVisibility / Occlusionnav_instruction / QAego_trajectorycan_bus二、 重点可疑字段深度剖析 (坑位预警)
在初步探查中,发现以下三个极易导致端到端模型训练崩溃的“坑”,需要团队重点关注:
visibility_token) 是离散分类(如 1: 0-40%),这对训练连续的不确定性分支来说信息量太低。construction(施工) 或temporary_stop属性的 Token,单独标记。三、 场景驱动的数据划分 (Split) 方案 (Draft)
传统的随机洗牌划分无法体现大模型处理“长尾不确定性”的优势。本组数据划分建议采用场景驱动法:
visibility=='1')、雨雾 (Rain/Fog)、夜晚 (Night) 等导致传感器退化的场景。四、 待协同与确认事项 (To 一作)
为了加速第一版 Baseline 的收敛,需要你(一作)确认以下关键信息: