10.8.1 breaks MTP with external MTP heads on user models

### Platform

Linux/Arch

### Lemonade Version

10.8.1

### GPU / APU Model

AMD Ryzen AI Max 395+

### Component

llama.cpp

### Bug Description

Lemonade now forbids passing -md to llama.cpp, without providing a way to pass the heads in user_models.json, essentially making us unable to use a custom Gemma 4 or other models with separate heads.

### Steps to Reproduce

1. Download the QAT versions of Gemma 4 from Unsloth
2. Download the associated MTP checkpoints
3. Try to load the model and pass the MTP head as argument

### Expected vs Actual Behavior

Lemonade should allow passing -md with user models, or at least use the same format in both server_models.json and user_models.json so we have at least one way to pass an MTP head.

### Log Output

```shell

```

### Additional Context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

10.8.1 breaks MTP with external MTP heads on user models #2435

Platform

Lemonade Version

GPU / APU Model

Component

Bug Description

Steps to Reproduce

Expected vs Actual Behavior

Log Output

Additional Context

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

10.8.1 breaks MTP with external MTP heads on user models #2435

Description

Platform

Lemonade Version

GPU / APU Model

Component

Bug Description

Steps to Reproduce

Expected vs Actual Behavior

Log Output

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions