Enable C++ only text generation #3260

mzegla · 2025-04-25T12:23:23Z

🛠 Summary

Enabling text generation endpoints in OVMS with Python disabled.

🧪 Checklist

Unit tests added.
The documentation updated.
Change follows security best practices.
``

Copilot

Pull Request Overview

This PR enables C++-only text generation endpoints by conditionally disabling Python-dependent code and replacing references to the old text processor with a new template processor. Key changes include:

Adding preprocessor guards (#if (PYTHON_DISABLE == 0)) around Python interpreter initialization and related includes in tests and production code.
Replacing references and include paths for the text processor with those for text utilities and the new PyJinjaTemplateProcessor.
Introducing debug output (via std::cout) in the ContinuousBatchingServableInitializer code.

Reviewed Changes

Copilot reviewed 22 out of 24 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
src/test/llm/llmnode_test.cpp	Wrapped Python initialization and updated include for text processing utilities.
src/mediapipe_internal/mediapipegraphexecutor.hpp	Moved LLM session side packet setting outside Python guard.
src/mediapipe_internal/mediapipegraphdefinition.cpp	Adjusted Python resource guard closing for node initialization.
src/llm/visual_language_model/legacy/servable_initializer.cpp	Replaced loadTextProcessor with loadTemplateProcessor behind a PYTHON_DISABLE guard.
src/llm/text_utils.hpp	Updated copyright year and removed Python embedding includes.
src/llm/servable_initializer.cpp	Renamed function from loadTextProcessor to loadTemplateProcessor and added new parsing function for max model length.
src/llm/language_model/continuous_batching/servable_initializer.cpp	Added extensive debug logging using std::cout.
Other files	Updated include references and conditional Python blocks to support C++-only text generation.

Files not reviewed (2)

src/BUILD: Language not supported
src/llm/BUILD: Language not supported

Copilot · 2025-04-25T12:24:23Z

src/llm/language_model/continuous_batching/servable_initializer.cpp

+    std::cout << "Parsed models path: " << parsedModelsPath << std::endl;
    if (!status.ok()) {
        return status;
    }
    auto properties = std::static_pointer_cast<ContinuousBatchingServableProperties>(servable->getProperties());
+    std::cout << "Properties casted successfully." << std::endl;

    properties->modelsPath = parsedModelsPath;
+    std::cout << "Models path set to: " << properties->modelsPath << std::endl;

    properties->schedulerConfig.max_num_batched_tokens = nodeOptions.max_num_batched_tokens();
+    std::cout << "Scheduler max_num_batched_tokens set to: " << properties->schedulerConfig.max_num_batched_tokens << std::endl;
+
    properties->schedulerConfig.cache_size = nodeOptions.cache_size();
+    std::cout << "Scheduler cache_size set to: " << properties->schedulerConfig.cache_size << std::endl;
+
    properties->schedulerConfig.dynamic_split_fuse = nodeOptions.dynamic_split_fuse();
+    std::cout << "Scheduler dynamic_split_fuse set to: " << properties->schedulerConfig.dynamic_split_fuse << std::endl;
+
    properties->schedulerConfig.max_num_seqs = nodeOptions.max_num_seqs();
+    std::cout << "Scheduler max_num_seqs set to: " << properties->schedulerConfig.max_num_seqs << std::endl;


Debug output using std::cout appears in production code; consider replacing these statements with the project's logging framework (e.g., SPDLOG) for consistency and better control.

Suggested change

std::cout << "Parsed models path: " << parsedModelsPath << std::endl;

if (!status.ok()) {

return status;

}

auto properties = std::static_pointer_cast<ContinuousBatchingServableProperties>(servable->getProperties());

std::cout << "Properties casted successfully." << std::endl;

properties->modelsPath = parsedModelsPath;

std::cout << "Models path set to: " << properties->modelsPath << std::endl;

properties->schedulerConfig.max_num_batched_tokens = nodeOptions.max_num_batched_tokens();

std::cout << "Scheduler max_num_batched_tokens set to: " << properties->schedulerConfig.max_num_batched_tokens << std::endl;

properties->schedulerConfig.cache_size = nodeOptions.cache_size();

std::cout << "Scheduler cache_size set to: " << properties->schedulerConfig.cache_size << std::endl;

properties->schedulerConfig.dynamic_split_fuse = nodeOptions.dynamic_split_fuse();

std::cout << "Scheduler dynamic_split_fuse set to: " << properties->schedulerConfig.dynamic_split_fuse << std::endl;

properties->schedulerConfig.max_num_seqs = nodeOptions.max_num_seqs();

std::cout << "Scheduler max_num_seqs set to: " << properties->schedulerConfig.max_num_seqs << std::endl;

spdlog::info("Parsed models path: {}", parsedModelsPath);

if (!status.ok()) {

return status;

}

auto properties = std::static_pointer_cast<ContinuousBatchingServableProperties>(servable->getProperties());

spdlog::info("Properties casted successfully.");

properties->modelsPath = parsedModelsPath;

spdlog::info("Models path set to: {}", properties->modelsPath);

properties->schedulerConfig.max_num_batched_tokens = nodeOptions.max_num_batched_tokens();

spdlog::info("Scheduler max_num_batched_tokens set to: {}", properties->schedulerConfig.max_num_batched_tokens);

properties->schedulerConfig.cache_size = nodeOptions.cache_size();

spdlog::info("Scheduler cache_size set to: {}", properties->schedulerConfig.cache_size);

properties->schedulerConfig.dynamic_split_fuse = nodeOptions.dynamic_split_fuse();

spdlog::info("Scheduler dynamic_split_fuse set to: {}", properties->schedulerConfig.dynamic_split_fuse);

properties->schedulerConfig.max_num_seqs = nodeOptions.max_num_seqs();

spdlog::info("Scheduler max_num_seqs set to: {}", properties->schedulerConfig.max_num_seqs);

mzegla requested a review from Copilot April 25, 2025 12:23

init

54097bb

mzegla force-pushed the jinja_cpp branch from 56d4439 to 54097bb Compare April 25, 2025 12:24

Copilot AI reviewed Apr 25, 2025

View reviewed changes

mzegla added 2 commits April 25, 2025 15:31

remove debug output

293cdd0

fix python build

6683938

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable C++ only text generation #3260

Enable C++ only text generation #3260

mzegla commented Apr 25, 2025

Copilot AI left a comment

Copilot AI Apr 25, 2025

Enable C++ only text generation #3260

Are you sure you want to change the base?

Enable C++ only text generation #3260

Conversation

mzegla commented Apr 25, 2025

🛠 Summary

🧪 Checklist

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Copilot AI Apr 25, 2025

Choose a reason for hiding this comment