Hi PixelRAG team,
First of all, congratulations on this amazing project! Implementing pixel-native visual RAG to preserve document layouts, tables, and charts is a massive leap forward.
We have been working on a similar challenge regarding document ingestion pipelines. In our open-source project.
Motor 15 (Geometric Audit)
[]https://github.com/joinsnipe/motor15-geometric-audit.
We use mathematical and spectral validation metrics (such as the Mantel Test, Wasserstein-1 distance on MST, and Heat Kernel Signature spectral correlation) to audit and certify the geometric and topological consistency between different embedding projection spaces.
Since PixelRAG retrieves documents visually, we believe it would be highly valuable to build a benchmark/validation tool that tests the metric isomorphism between:
- The original text-based embedding space.
- PixelRAG's visual/pixel-based embedding space.
This would mathematically certify that shifting from text to pixel-native representation preserves the geometric structure of semantic relationships without introducing metric distortions or latent space deformations.
Would you be open to a discussion or a contribution demonstrating how we can apply these geometric audit methods to PixelRAG's embedding spaces?
Best regards,
Rubén Abella / TRACE Team
Hi PixelRAG team,
First of all, congratulations on this amazing project! Implementing pixel-native visual RAG to preserve document layouts, tables, and charts is a massive leap forward.
We have been working on a similar challenge regarding document ingestion pipelines. In our open-source project.
Motor 15 (Geometric Audit)
[]https://github.com/joinsnipe/motor15-geometric-audit.
We use mathematical and spectral validation metrics (such as the Mantel Test, Wasserstein-1 distance on MST, and Heat Kernel Signature spectral correlation) to audit and certify the geometric and topological consistency between different embedding projection spaces.
Since PixelRAG retrieves documents visually, we believe it would be highly valuable to build a benchmark/validation tool that tests the metric isomorphism between:
This would mathematically certify that shifting from text to pixel-native representation preserves the geometric structure of semantic relationships without introducing metric distortions or latent space deformations.
Would you be open to a discussion or a contribution demonstrating how we can apply these geometric audit methods to PixelRAG's embedding spaces?
Best regards,
Rubén Abella / TRACE Team