Skip to content

Prefer visqol-python 3.5.0 and ensure lattice speech support#16

Merged
nschimme merged 3 commits into
masterfrom
prefer-visqol-python-v3.5.0-14811030729734419766
May 26, 2026
Merged

Prefer visqol-python 3.5.0 and ensure lattice speech support#16
nschimme merged 3 commits into
masterfrom
prefer-visqol-python-v3.5.0-14811030729734419766

Conversation

@nschimme

Copy link
Copy Markdown
Owner

This PR updates the FAAC Benchmark Suite to use visqol-python as the preferred backend for perceptual quality (MOS) computation.

Key changes:

  1. Validation: Confirmed that visqol-python 3.5.0 produces MOS scores consistent with the C++ visqol binary on the TCD-VOIP speech dataset, resolving previous concerns about inflated scores.
  2. Dependency Update: Updated requirements.txt to visqol-python[accel,lattice]>=3.5.0 to ensure optimized execution and the presence of the deep-lattice speech mapper.
  3. Backend Prioritization: Modified phase2_mos.py and run_benchmark.py to move visqol-python to the top of the priority list for "auto" mode.
  4. Speech Model Support: Added logic to explicitly use the lattice_tcditugenmeetpackhref_ls2_nl60_lr12_bs2048_learn.005_ep2400_train1_7_raw.tflite model for speech mode when available, ensuring parity with the C++ implementation.
  5. GitHub Action Update: Changed the default backend for the GitHub Action from docker to visqol-python for faster and simpler execution.
  6. Docker Maintenance: Updated Dockerfile.visqol to include the new dependencies and fixed the PYTHONPATH for compatibility with the Ubuntu 22.04 base image.

PR created automatically by Jules for task 14811030729734419766 started by @nschimme

- Validated that visqol-python issue #1 is fixed (MOS consistency restored).
- Updated requirements.txt to use visqol-python[accel,lattice]>=3.5.0.
- Updated phase2_mos.py to prioritize visqol-python and use the deep-lattice speech model.
- Updated action.yml and run_benchmark.py to make visqol-python the default backend.
- Fixed Dockerfile.visqol to include lattice support and use correct PYTHONPATH.
- Cleaned up visqol-python initialization logic for speech/audio modes.
@google-labs-jules

Copy link
Copy Markdown

👋 Jules, reporting for duty! I'm here to lend a hand with this pull request.

When you start a review, I'll add a 👀 emoji to each comment to let you know I've read it. I'll focus on feedback directed at me and will do my best to stay out of conversations between you and other bots or reviewers to keep the noise down.

I'll push a commit with your requested changes shortly after. Please note there might be a delay between these steps, but rest assured I'm on the job!

For more direct control, you can switch me to Reactive Mode. When this mode is on, I will only act on comments where you specifically mention me with @jules. You can find this option in the Pull Request section of your global Jules UI settings. You can always switch back!

New to Jules? Learn more at jules.google/docs.


For security, I will only act on instructions from the user who triggered this task.

@sourcery-ai sourcery-ai Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry @nschimme, you have reached your weekly rate limit of 500000 diff characters.

Please try again later or upgrade to continue using Sourcery

nschimme added 2 commits May 26, 2026 16:49
- Validated visqol-python 3.5.0 speech MOS consistency against C++ binary.
- Updated requirements.txt and Dockerfile.visqol to use visqol-python[accel,lattice]>=3.5.0.
- Prioritized visqol-python in phase2_mos.py and run_benchmark.py auto-selection.
- Added explicit support for the lattice speech model in phase2_mos.py.
- Removed dead batching code from phase2_mos.py.
- Updated README.md and action.yml to reflect new default backend.
- Fixed Dockerfile.visqol PYTHONPATH for Ubuntu 22.04 compatibility.
…ching

- Validated visqol-python 3.5.0 speech MOS consistency against C++ binary.
- Updated requirements.txt to use visqol-python[accel,lattice]>=3.5.0.
- Prioritized visqol-python in phase2_mos.py and run_benchmark.py auto-selection.
- Implemented optimized batch processing in phase2_mos.py using measure_batch(parallel=True).
- Added explicit support for the lattice speech model in phase2_mos.py.
- Removed dead code and updated README.md / action.yml for new defaults.
- Enhanced CI to test Docker build on PRs (build only).
- Reverted Dockerfile.visqol to use visqol-py with pinned dependencies.
@nschimme nschimme force-pushed the prefer-visqol-python-v3.5.0-14811030729734419766 branch from 60fd175 to a42c51a Compare May 26, 2026 17:25
@nschimme nschimme merged commit de50a99 into master May 26, 2026
5 checks passed
@nschimme nschimme deleted the prefer-visqol-python-v3.5.0-14811030729734419766 branch May 26, 2026 17:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant