Refactor Navigation and KGE Pipeline: Consistent Entity Naming, Enhanced Evaluation, and Robust Config Management by HernandezEduin · Pull Request #34 · HalcyonSolutions/MultiHopKG

HernandezEduin · 2025-09-15T07:11:12Z

Summary of Recent Changes

This pull request brings major refactors, feature enhancements, configuration changes, and bug fixes, particularly around navigation (supervised), KGE, and configuration management.

Highlights

Consistent Data Schema and Naming
- Standardized the codebase to use source_ent/Source-Entity throughout, replacing query_ent/Query-Entity and the use of query_rel (relation) where possible. This impacts all data loading, supervision, and environment interfaces.
- Function signatures, docstrings, and data processing logic updated for consistency and clarity across supervised and RL navigation.
Supervised Path Learning
- Cleanup and removal of unnecessary variables and lines.
- Added saving and loading functions for supervised navigation.
- Config updates and added support for optional validation and model checkpointing.
- Training, testing, and validation can now be performed independently.
- Rollouts and noise injection added to training process.
- Improved tqdm and metrics printing for easier monitoring.
- Updated configs and sweeps for best performance (KinshipHinton, etc.).
- Configurations added for reproducibility.
- Fixed typo in filename: nav_superviced_training.py → nav_supervised_training.py.
Knowledge Graph Embedding (KGE)
- Moved KGE loading logic from training scripts to utility modules.
- Log metrics and model saving functions refactored into utilities.
- Embedded calculations and cleaning functions moved to appropriate utility files.
- Best model loading for evaluation after training.
- Added option for full graph training (navigability focus).
- Improved config dump formatting for clarity.
Configuration Management
- Major reorganization: RL navigation and KGE configs/sweeps moved to dedicated directories.
- Timestamps corrected to consistent format.
- Store action now used for certain CLI arguments.
- Improved data splitting: now recognizes both dev and test splits if present, with fallback to random splits.
Reinforcement Learning and Environment
- Added rollouts to supervised navigation and noise to agent position.
- Added option for extrinsic reward calculation using FAISS or epsilon threshold.
- Refactored rollout logic: now returns richer tensors, loss calculation done separately.
- Environment now recycles answer ID tensors for efficiency.
- Added and clarified metric variable names for disambiguation.
- Improved initial state/embedding logic and batch handling for environment resets and RL episodes.
Metrics and Evaluation
- Enhanced evaluation routines: now computes Hits@1, Hits@3, Hits@5, Hits@10, Hits@20, and Mean Reciprocal Rank (MRR) with improved rank calculation.
- Logging and output updated to match new metrics.
- Improved nearest-neighbor search and reshaping for batch evaluation.
General Refactoring
- Refactored and consolidated code from exogenous/sun_models.py into utils modules for convenience, saving, cleaning, and metrics.
- Removed obsolete triplet-saving and debugging code.
- Improved variable naming, documentation, bugfixes for index/batch handling, and code maintainability throughout.

This PR introduces significant backend improvements, enhances reproducibility, and streamlines configuration and training workflows. It lays groundwork for future development and improves maintainability for navigation and embedding tasks.

Major affected files

README.md
mlm_training.py
nav_supervised_training.py
nav_training.py
multihopkg/data_utils.py
multihopkg/rl/graph_search/pn.py
multihopkg/utils_debug/dump_evals.py
multihopkg/vector_search.py

For the full commit list and details, see the commit history in the source repository.

Removing query_relation (as it is unused), saving of the triplets from QA (as the benchmark already works), and renaming Query-Entity to Source-Entity for a clearer understanding of the variable.

Modified all the necessary files to that are affected by the renaming of Source-Entity and the removal of Query-Relation Additional Correction for Single Hop Supervised

Update README.md

Forcing found cache to be the cached_metadata_path if the file exists. The function `shift_through_cache_data` returns None regardless of the situation, causing a forced recompute for each new instance.

Modified vector search to be able to take 3D tensors where the 2nd Dim is the rollouts.

Modified code to be able to evaluate the performance in a similar fashion to literature: ranks the rollouts according to their scores (distance to closest entity in this case) Correction

- Correcting the dimension in test_nav due to num_rollouts - Using Path Log Prob to do the ranking of the Rollouts - Adding back Mean Rank (MR) to have a better idea of the performance

Added an additional argument which controls the number of rollouts during the EVALUATION phase. This is necessary for the benchmark metric comparison. Keep at 100.

- Adding Comments of varible expected shape - Changing Metric Variable Name from Distance to Ans Distance for disambiguation with the distance provided by FAISS

- Replaced extracting answer_tensor and instead using the one already calculated by the environment. - Additionally returning tensors of shape (batch size, num_rollouts, steps) from Rollout instead of (batch_size, steps) - Calculating Reinforce Loss in a Separate Function

Added an optional variable in ITLGraphEnvironment for choosing whether to use FAISS to calculate the closest entity and reward the agent if it is the answer vs using episilon distance threshold with the answer embedding. FAISS should provide a correct approach, but it slows down the computation.

- Added Rollouts to the training during Supervised - Added Noise to the current position of the Agent - Modified the Query Rel for the Adapter to be the average relations emb of the supervised path

The configurations (best) and sweeps for the supervised path learning are added and moved to ./configs/supervised_path_learning/ for disambiguation.

Moving KGE training related configs and sweeps into ./configs/embedding_training.

Moving RL Navigation Configs and Sweeps to ./configs/navigation/

Adding indent to make it more presentable

Adding a parameter to allow trainining on the full graph (assumming KGE completion is no longer the task, instead of it the task is navigability).

…lidation

Before the beginning of training, a pretrained model can be loaded if `--checkpoint_path` is a valid path and contains the dir with the model. Addionally, validation is now optional in the training.

Moving the functions from exogenous/sun_models to utils/convenience for a better location: - calculate_entity_range - calculate_entity_centroid - get_entity_embeddings_from_indices Modifying affected files accordingly: - mlm_training.py - nav_supervised_training.py - nav_training.py - multihopkg/rl/graph_search/pn.py

Moving and renaming the following functions: - save_configs -> save_train_configs - save_model -> save_kge_model - update_best_model -> update_best_kge_model from exogenous/sun_model.py to utils/saving.py Updated the following affected files: - kge_train.py - nav_supervised_training.py

Moving the following functions from exogenous/sun_models to utils/cleaning - clean_up_checkpoints - clean_up_folder Affected files - kge_train.py

Removing unnecessary variables and lines in supervised path learning algorithm

HernandezEduin added 30 commits August 25, 2025 16:10

Seeds: Ensuring Transformers Receive Seeds

4f8aefb

Data Utils - SplitLabel: Correction for the Test/Dev Split

2be1b1f

Data Utils: Source Entity and Cleaning

f6aeb26

Removing query_relation (as it is unused), saving of the triplets from QA (as the benchmark already works), and renaming Query-Entity to Source-Entity for a clearer understanding of the variable.

Query-Entity -> Source-Entity & Removing Query-Relation

4a4b94c

Modified all the necessary files to that are affected by the renaming of Source-Entity and the removal of Query-Relation Additional Correction for Single Hop Supervised

File Name Correction: Superviced -> Supervised

2318bd5

Update README.md

Found Cache Issue: Temp Solution

057e2e9

Forcing found cache to be the cached_metadata_path if the file exists. The function `shift_through_cache_data` returns None regardless of the situation, causing a forced recompute for each new instance.

VectorSearch for Rollouts

85e63ec

Modified vector search to be able to take 3D tensors where the 2nd Dim is the rollouts.

Evaluation on Rollouts

6a010a6

Modified code to be able to evaluate the performance in a similar fashion to literature: ranks the rollouts according to their scores (distance to closest entity in this case) Correction

Test NAV: Path-Ranking based on Log Probs

452e6b9

- Correcting the dimension in test_nav due to num_rollouts - Using Path Log Prob to do the ranking of the Rollouts - Adding back Mean Rank (MR) to have a better idea of the performance

Arguments: Test Rollouts

cd1cd07

Added an additional argument which controls the number of rollouts during the EVALUATION phase. This is necessary for the benchmark metric comparison. Keep at 100.

Adding Comments and Changing Variable Name

3cce361

- Adding Comments of varible expected shape - Changing Metric Variable Name from Distance to Ans Distance for disambiguation with the distance provided by FAISS

Recycling Answer ID Tensor from Environment

968eb28

NAV Supervised: Adding Rollouts and Noise to Training

2dac994

- Added Rollouts to the training during Supervised - Added Noise to the current position of the Agent - Modified the Query Rel for the Adapter to be the average relations emb of the supervised path

Supervised Path Learning: Configs and Sweeps

e598545

The configurations (best) and sweeps for the supervised path learning are added and moved to ./configs/supervised_path_learning/ for disambiguation.

KGE Sweeps and Configs

affb165

Moving KGE training related configs and sweeps into ./configs/embedding_training.

RL Navigation Sweeps

5454832

Moving RL Navigation Configs and Sweeps to ./configs/navigation/

Alpha: Using Store Action instead of Str for use_ann_reward

19a32c5

Correcting Timestamp to YYYY/mm/dd

41454a8

Configs/Supervised: Updating models config to match paper

29051e8

Sun Model: Improving Config Dump

7208e36

Adding indent to make it more presentable

KGE Train: Adding Full Graph Training

553e5f7

Adding a parameter to allow trainining on the full graph (assumming KGE completion is no longer the task, instead of it the task is navigability).

KGE Train: Loading Best Model for Evaluation after Training

eba69cb

Supervised Path Learning: Adding Best Configs for KinshipHinton

83161a7

KGE: Adding best Config for KinshipHinton KGE training

5c0a6f6

Supervised Path Learning: Modifying tqdm and metrics printing

f354e30

NAV SV: Saving Best Model During Training

5ca0c0c

ALPHA: Adding optional parameter to perform training, testing, and va…

500132f

…lidation

NAV SV: Do Train, Test, and Valid independently

45a3636

HernandezEduin added 10 commits September 10, 2025 13:25

Alpha: Adding Checkpoint Path to load a pretrained model

1672aba

NAV SV: Valid Option in Train and Loading Pretrained Model

fb0e21d

Before the beginning of training, a pretrained model can be loaded if `--checkpoint_path` is a valid path and contains the dir with the model. Addionally, validation is now optional in the training.

Configs: Updating Nav SV Configs

73c85ec

Moving KG Cleanup from sun_models to utils/cleaning

91bf7ac

Moving the following functions from exogenous/sun_models to utils/cleaning - clean_up_checkpoints - clean_up_folder Affected files - kge_train.py

Moving KGE Log Metrics from KGE Train to utils/metrics

34849d3

Moving KGE loading from KGE train to utils/saving

c7900de

NAV SV: Added Saving and Loading functions

ae7e5d6

NAV SV: Cleanup

d56a902

Removing unnecessary variables and lines in supervised path learning algorithm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor Navigation and KGE Pipeline: Consistent Entity Naming, Enhanced Evaluation, and Robust Config Management#34

Refactor Navigation and KGE Pipeline: Consistent Entity Naming, Enhanced Evaluation, and Robust Config Management#34
HernandezEduin wants to merge 40 commits into
HalcyonSolutions:masterfrom
HernandezEduin:master

HernandezEduin commented Sep 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

HernandezEduin commented Sep 15, 2025

Summary of Recent Changes

Highlights

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant