Bump PyTorch to 2.7.0 #3455

AlexanderDokuchaev · 2025-04-23T21:38:24Z

Changes

Update pytorch to 2.7.0
Regenerate ref data for test_generate_text_data_functional, changed output of hf-internal-testing/tiny-random-gpt2

Tests

manual/job/post_training_quantization/661/ - fail FX on save compressed models
nightly/job/TriggerBetta/1029/ -
examples :

llm_compression_qat_with_lora assert 0.034 == 0.027 ± 2.0e-03
llm_compression_synthetic - AssertionError: metric word_count: 81 != 83

wc - pass
Test_Install - pass

ljaljushkin · 2025-04-25T13:44:26Z

It appears that F.linear() is the source of indeterminism.

When model is on the CUDA, the code below

model = AutoModelForCausalLM.from_pretrained(BASE_TEST_MODEL_ID, device_map="cuda")
torch.use_deterministic_algorithms(True)

leads to an error:

../../env/nncf-py/lib/python3.10/site-packages/torch/nn/modules/linear.py:125: in forward
return F.linear(input, self.weight, self.bias)

RuntimeError: Deterministic behavior was enabled with either torch.use_deterministic_algorithms(True) or at::Context::setDeterministicAlgorithms(true), but this operation is not deterministic because it uses CuBLAS and you have CUDA >= 10.2. To enable deterministic behavior in this case, you must set an environment variable before running your PyTorch application: CUBLAS_WORKSPACE_CONFIG=:4096:8 or CUBLAS_WORKSPACE_CONFIG=:16:8. For more information, go to https://docs.nvidia.com/cuda/cublas/index.html#results-reproducibility

Then, with export CUBLAS_WORKSPACE_CONFIG = :4096 :8, the results are the same with torch 2.6.0 and 2.7.0

The same thing may happen with qat-lora sample on CPU.
When I switched to GPU, wwb reference were different:

use_deterministic_algorithms + CUBLAS_WORKSPACE_CONFIG aligns it.

Propose updating references, since with use_deterministic_algorithms the test would be slower, but the results were the same.

torch270

5c9476f

github-actions bot added documentation Improvements or additions to documentation NNCF PT Pull requests that updates NNCF PyTorch labels Apr 23, 2025

AlexanderDokuchaev added 3 commits April 24, 2025 12:03

update ref data

b87aa41

update import in comments

0906a4e

bkc

c102a54

github-actions bot added NNCF OpenVINO Pull requests that updates NNCF OpenVINO NNCF ONNX Pull requests that updates NNCF ONNX labels Apr 24, 2025

AlexanderDokuchaev added 2 commits April 24, 2025 12:12

Merge branch 'develop' into ad/torch270

ad24a1c

cuda12.6

f917d51

AlexanderDokuchaev force-pushed the ad/torch270 branch from 7889725 to f917d51 Compare April 24, 2025 09:53

AlexanderDokuchaev added 2 commits April 24, 2025 15:06

d

18dd063

Merge branch 'develop' into ad/torch270

f96ddb6

AlexanderDokuchaev changed the title ~~torch270~~ Bump PyTorch to 2.7.0 Apr 24, 2025

daniil-lyakhov mentioned this pull request Apr 25, 2025

[PTQ Conformance][FX] Fix Torch Compile OV Model Caching #3459

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump PyTorch to 2.7.0 #3455

Bump PyTorch to 2.7.0 #3455

AlexanderDokuchaev commented Apr 23, 2025 •

edited

Loading

ljaljushkin commented Apr 25, 2025

Bump PyTorch to 2.7.0 #3455

Are you sure you want to change the base?

Bump PyTorch to 2.7.0 #3455

Conversation

AlexanderDokuchaev commented Apr 23, 2025 • edited Loading

Changes

Tests

ljaljushkin commented Apr 25, 2025

AlexanderDokuchaev commented Apr 23, 2025 •

edited

Loading