Hostfix: remove not needed params from load_model#2209

Merged

qnixsynapse merged 10 commits intodevfromhostfix/remove_pooling

Jun 12, 2025

Merged

Hostfix: remove not needed params from load_model#2209

qnixsynapse merged 10 commits intodevfromhostfix/remove_pooling

Jun 12, 2025

Conversation

Copy link

Contributor

qnixsynapse commentedJun 12, 2025

Describe Your Changes

Remove unneeded params from load_model and use llama.cpp defaults for most of the params

Fixes Issues

Closes #
Closes #

Self Checklist

Added relevant comments, esp in complex areas
Updated docs (for bug fixes / features)
Created issues for follow-up changes or refactoring needed

qnixsynapse added2 commits

June 12, 2025 10:11

refactor: remove --pooling flag from model loading

32a7bae

The --pooling flag was removed as the mean pooling functionality not needed in chat models. This fixes the regression

feat(local-engine): add ctx_len parameter support

d9ea600

Adds support for the ctx_len parameter by appending --ctx-size with its value. Removed outdated parameter mappings from the kParamsMap to reflect current implementation details and ensure consistency.

github-project-automationbot added this toJan

Jun 12, 2025

qnixsynapse added2 commits

June 12, 2025 11:29

feat: add conditional model parameters based on path

4a02ef5

When the model path contains both "jan" and "nano" (case-insensitive), automatically addspeculative decoding parameters to adjust generation behavior. This improvesflexibility by enabling environment-specific configurations without manualparameter tuning. Also includes necessary headers for string manipulation andfixes whitespace in ctx_len handling.

chore: remove redundant comment

41023d3

The comment was redundant as the code's purpose is clear without it, improving readability.

qnixsynapse requested review fromMinh141120,david-menloai andlouis-jan

June 12, 2025 06:15

qnixsynapseenabled auto-merge (squash)

June 12, 2025 06:59

qnixsynapseand others added6 commits

June 12, 2025 12:47

feat: add new parameters and flags to local engine configuration

87ad9bc

This commit introduces new configuration parameters and their corresponding command-line flags for the local engine. The changes include:- Adding "flash_attn" to ignored parameters- Mapping UI parameters to CLI flags (e.g., cpu_threads → --threads)- Expanding support for various model configuration optionsThese additions enhance the flexibility of the local engine by enabling fine-grained control over performance and behavior through both UI and CLI interfaces.

feat: add support for 'qwen' in parameter conversion

8725f38

The condition was updated to include 'qwen' in the check for triggering specific parameters('--temp', '--top-p', etc.), aligning it with the existing 'jan' and 'nano' validation logic. This allowsthe same parameter configuration to apply to 'qwen' models as well as the original keywords.

fix: remove deprecated parameters and adjust ignored list

7ae8a15

Removed deprecated parameters such as "dynatemp_exponent" and "ctx_len" handling logic,which were no longer needed. Added "flash_attn" back to the ignored parameters list.Cleaned up the parameter conversion logic by removing conditional blocks forspecific model optimizations that are no longer required.