Skip to content

DSWU-BackDoor/2026_OSM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

40 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“ 디렉토리 ꡬ쑰

scripts/
 β”œβ”€β”€ pipeline.py
 β”œβ”€β”€ objects_extractor.py
 β”œβ”€β”€ object_version_extractor.py
 β”œβ”€β”€ gnn_feature_extractor.py
 └── lgbm_feature_extractor.py

test-data/
 β”œβ”€β”€ changesets.csv
 β”œβ”€β”€ ovid_labels.tsv
 └── training/labels.tsv

output/
 β”œβ”€β”€ objects.jsonl
 β”œβ”€β”€ fetch_prev_queue.csv
 β”œβ”€β”€ object_versions.jsonl
 β”œβ”€β”€ processed_changesets.txt
 β”œβ”€β”€ processed_versions.txt
 β”œβ”€β”€ nodes.csv
 β”œβ”€β”€ edges.csv
 β”œβ”€β”€ labels.csv
 └── lgbm_features.csv

πŸ”„ 전체 흐름

dataset (--dataset)
   ↓
changeset ID μΆ”μΆœ
   ↓
objects_extractor
   ↓
objects.jsonl + fetch_prev_queue.csv
   ↓
object_version_extractor (κΈ°λ³Έ ON)
   ↓
object_versions.jsonl
  • 이미 처리된 changeset / version은 μžλ™ μŠ€ν‚΅ (λˆ„μ  μ‹€ν–‰ κ°€λŠ₯)
  • 기본적으둜 이전 버전(prev)도 ν•¨κ»˜ μˆ˜μ§‘
  • --no-prev μ˜΅μ…˜ μ‚¬μš© μ‹œ 이전 버전 μˆ˜μ§‘ μƒλž΅

β–Ά μ‹€ν–‰

κΈ°λ³Έ μ‹€ν–‰ μ˜ˆμ‹œ:

python scripts/pipeline.py --dataset changesets

λ²”μœ„ μ§€μ •:

python scripts/pipeline.py --dataset ovid --start 0 --end 100

이전 버전 μˆ˜μ§‘ 끄기:

python scripts/pipeline.py --dataset training --no-prev

output μ΄ˆκΈ°ν™” ν›„ λ‹€μ‹œ μ‹€ν–‰:

python scripts/pipeline.py --dataset changesets --overwrite

βš™ μ£Όμš” μ˜΅μ…˜

μ˜΅μ…˜ μ„€λͺ…
--dataset μ‚¬μš©ν•  데이터셋 (changesets, ovid, training)
--start / --end μ²˜λ¦¬ν•  ID λ²”μœ„
--output-dir 좜λ ₯ 디렉토리 (κΈ°λ³Έ: ./output)
--overwrite κΈ°μ‘΄ κ²°κ³Ό μ΄ˆκΈ°ν™”
--no-prev 이전 버전 μˆ˜μ§‘ λΉ„ν™œμ„±ν™” (기본은 ON)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages