Llama.cpp Now Supports Qwen2-VL (Vision Language Model)

Testing right now to get it running.

if i git clone the Qwen/Qwen2-VL-2B-Instruct repo in /whatever/Qwen/Qwen2-VL-2B-Instruct/ and make a gguf out of it with convert_hf_to_gguf.py everything is fine and i get a Qwen-Qwen2-VL-2B-Instruct-F16.gguf

But when i try to convert the vision encoder to GGUF format with qwen2_vl_surgery.py :
python examples/llava/qwen2_vl_surgery.py "/whatever/Qwen/Qwen2-VL-2B-Instruct/"

i can’t, python throws an error :

v.blk.0.ln1.weight

…

[to_gguf_name] merger.mlp.2.bias –> mm.2.bias
Traceback (most recent call last):

File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/utils/_http.py”, line 406, in hf_raise_for_status
response.raise_for_status()
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/requests/models.py”, line 1024, in raise_for_status
raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 401 Client Error: Unauthorized for url: https://huggingface.co/Qwen2-VL-2B-Instruct/resolve/main/processor_config.json

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/transformers/utils/hub.py”, line 403, in cached_file
resolved_file=hf_hub_download(
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py”, line 114, in _inner_fn
return fn(*args, **kwargs)
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py”, line 862, in hf_hub_download
return _hf_hub_download_to_cache_dir(
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py”, line 969, in _hf_hub_download_to_cache_dir
_raise_on_head_call_error(head_call_error, force_download, local_files_only)
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py”, line 1484, in _raise_on_head_call_error
raise head_call_error
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py”, line 1376, in _get_metadata_or_catch_error
metadata=get_hf_file_metadata(
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py”, line 114, in _inner_fn
return fn(*args, **kwargs)
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py”, line 1296, in get_hf_file_metadata
r=_request_wrapper(
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py”, line 277, in _request_wrapper
response=_request_wrapper(
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py”, line 301, in _request_wrapper
hf_raise_for_status(response)
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/utils/_http.py”, line 454, in hf_raise_for_status
raise _format(RepositoryNotFoundError, message, response) from e
huggingface_hub.errors.RepositoryNotFoundError: 401 Client Error. (Request ID: Root=1-675de5d3-319c681e02ab26174d878b71;82815812-2512-468e-bba8-8beac819dd0c)

Repository Not Found for url: https://huggingface.co/Qwen2-VL-2B-Instruct/resolve/main/processor_config.json.
Please make sure you specified the correct `repo_id` and `repo_type`.
If you are trying to access a private or gated repo, make sure you are authenticated.
Invalid username or password.

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File “/whatever/llama.cpp/examples/llava/qwen2_vl_surgery.py”, line 158, in
main(args)
File “/whatever/llama.cpp/examples/llava/qwen2_vl_surgery.py”, line 142, in main
processor: Qwen2VLProcessor=AutoProcessor.from_pretrained(model_name)
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/transformers/models/auto/processing_auto.py”, line 254, in from_pretrained
processor_config_file=get_file_from_repo(
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/transformers/utils/hub.py”, line 557, in get_file_from_repo
return cached_file(
File “/whatever/llama.cpp/venv/lib/python3.10/site-packages/transformers/utils/hub.py”, line 426, in cached_file
raise EnvironmentError(
OSError: Qwen2-VL-2B-Instruct is not a local folder and is not a valid model identifier listed on ‘https://huggingface.co/models’
If this is a private repository, make sure to pass a token having permission to this repo either by logging in with `huggingface-cli login` or by passing `token=`”>

(venv) ali0une@Debian:~/compil/llama.cpp$ python examples/llava/qwen2_vl_surgery.py "/whatever/Qwen/Qwen2-VL-2B-Instruct"
model_name:  /whatever/Qwen/Qwen2-VL-2B-Instruct
`Qwen2VLRotaryEmbedding` can now be fully parameterized by passing the model config through the `config` argument. All other arguments will be removed in v4.46
Loading checkpoint shards: 100%|████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00,  2.17it/s]
Qwen2VLVisionConfig {
  "depth": 32,
  "embed_dim": 1280,
  "hidden_act": "quick_gelu",
  "hidden_size": 1536,
  "in_channels": 3,
  "in_chans": 3,
  "mlp_ratio": 4,
  "model_type": "qwen2_vl",
  "num_heads": 16,
  "patch_size": 14,
  "spatial_merge_size": 2,
  "spatial_patch_size": 14,
  "temporal_patch_size": 2,
  "transformers_version": "4.47.0"
}

[to_gguf_name] vision_model.blocks.0.norm1.weight --> v.blk.0.ln1.weight

...

[to_gguf_name] merger.mlp.2.bias --> mm.2.bias
Traceback (most recent call last):

  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/utils/_http.py", line 406, in hf_raise_for_status
    response.raise_for_status()
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/requests/models.py", line 1024, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 401 Client Error: Unauthorized for url: https://huggingface.co/Qwen2-VL-2B-Instruct/resolve/main/processor_config.json

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/transformers/utils/hub.py", line 403, in cached_file
    resolved_file=hf_hub_download(
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py", line 114, in _inner_fn
    return fn(*args, **kwargs)
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 862, in hf_hub_download
    return _hf_hub_download_to_cache_dir(
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 969, in _hf_hub_download_to_cache_dir
    _raise_on_head_call_error(head_call_error, force_download, local_files_only)
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 1484, in _raise_on_head_call_error
    raise head_call_error
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 1376, in _get_metadata_or_catch_error
    metadata=get_hf_file_metadata(
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/utils/_validators.py", line 114, in _inner_fn
    return fn(*args, **kwargs)
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 1296, in get_hf_file_metadata
    r=_request_wrapper(
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 277, in _request_wrapper
    response=_request_wrapper(
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/file_download.py", line 301, in _request_wrapper
    hf_raise_for_status(response)
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/huggingface_hub/utils/_http.py", line 454, in hf_raise_for_status
    raise _format(RepositoryNotFoundError, message, response) from e
huggingface_hub.errors.RepositoryNotFoundError: 401 Client Error. (Request ID: Root=1-675de5d3-319c681e02ab26174d878b71;82815812-2512-468e-bba8-8beac819dd0c)

Repository Not Found for url: https://huggingface.co/Qwen2-VL-2B-Instruct/resolve/main/processor_config.json.
Please make sure you specified the correct `repo_id` and `repo_type`.
If you are trying to access a private or gated repo, make sure you are authenticated.
Invalid username or password.

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/whatever/llama.cpp/examples/llava/qwen2_vl_surgery.py", line 158, in 
    main(args)
  File "/whatever/llama.cpp/examples/llava/qwen2_vl_surgery.py", line 142, in main
    processor: Qwen2VLProcessor=AutoProcessor.from_pretrained(model_name)
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/transformers/models/auto/processing_auto.py", line 254, in from_pretrained
    processor_config_file=get_file_from_repo(
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/transformers/utils/hub.py", line 557, in get_file_from_repo
    return cached_file(
  File "/whatever/llama.cpp/venv/lib/python3.10/site-packages/transformers/utils/hub.py", line 426, in cached_file
    raise EnvironmentError(
OSError: Qwen2-VL-2B-Instruct is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
If this is a private repository, make sure to pass a token having permission to this repo either by logging in with `huggingface-cli login` or by passing `token=`

But it just works without specifying the path to the downloaded HF model repo :
(venv) ali0une@Debian:~/compil/llama.cpp$ python examples/llava/qwen2_vl_surgery.py

i get the /whatever/llama.cpp/qwen-qwen2-vl-2b-instruct-vision.gguf

Note: This article have been indexed to our site. We do not claim legitimacy, ownership or copyright of any of the content above. To see the article at original source Click Here

Seagate Exos ST24000NM000C 24TB 7.2K RPM SATA 6Gb/s 512e 3.5in Enterprise Hard Drive (Renewed)

(14)

$299.99 (as of January 22, 2025 19:19 GMT +00:00 - )

Stanley Quencher H2.0 Tumbler with Handle & Straw 30 oz | Twist On 3-Way Lid | Cupholder Compatible for Travel |…

(53753)

$35.00 (as of January 21, 2025 19:55 GMT +00:00 - )

Luces 100 Days of School Cosplay Set for Girls, Old Lady Dress Up for Kids, 100th Day School Granny Wig…

(1)

$34.99 (as of January 22, 2025 19:19 GMT +00:00 - )

ihuan Winter Waterproof Ski Gloves Men Women, Snow Warm Cycling Cold Weather, Driving Biking Running

(6373)

$19.99 (as of January 21, 2025 19:55 GMT +00:00 - )

SAMSUNG Galaxy Buds 3 Pro AI True Wireless Bluetooth Earbuds, Noise Cancelling, Sound Optimization, Real-Time Interpreter, Redesigned…

$174.00 (as of January 22, 2025 19:19 GMT +00:00 - )

Index Of News Author

Technology

Samsung Galaxy A22 4G receives January security update

Primeiro só foi implementado no mercado asiático, mas agora o último patch de segurança de Janeiro de 2022 está a ser lançado nos mercados europeus e outros. De acordo com o último lançamento, esta actualização do patch de segurança de Janeiro de 2022 está disponível para o Samsung Galaxy A22 4G! A empresa sul-coreana Samsung…

January 22, 2022

Technology

Android 12 for OPPO and Realme devices already has arrival date

E não nos estamos a referir à queda global do WhatsApp e outros serviços do Facebook de ontem, mas de tudo relacionado ao Android 12. E não nos estamos a referir à queda global do WhatsApp e outros serviços do Facebook de ontem, mas de tudo relacionado ao Android 12. O mês de Outubro ainda…

October 5, 2021

Technology

The 5GHz “Problem” for Wi-Fi Networks: DFS (2018)

Wi-Fi networking provides us with 2 bands for the operation of wireless LAN networks: the 2.4Ghz band and the 5GHz band. The 2.4GHz band has a reputation of being something of a “sewer” of a band, due to its limited number of usable channels, the number of Wi-Fi devices already using the band, and the

December 26, 2022

Technology

When WhatsApp crashed, it was Telegram again that made a profit!

Anlık mesajlaşma uygulaması Telegram, Pazartesi günü Facebook kesintisi sırasında 70 milyondan fazla yeni kullanıcı kazandığını duyurdu. Dünyanın dört bir yanındaki insanların yaklaşık altı saat boyunca önemli mesajlaşma servislerinden yoksun kaldığı günde kar eden yine Telegram oldu. WhatsApp ne zaman bir sorunla karşılaşsa işin kaymağını Telegram yiyor diyebiliriz. Yaşanan Facebook, Instagram ve WhatsApp çöküntüsünün ardından tekrar…

October 6, 2021

Technology

Китайский производитель литографического оборудования отрицает нарушение патентов ASML

12.02.2022 [10:41], Павел Котов Ранее на этой неделе стало известно, что нидерландский производитель литографических сканеров ASML обвинил своего китайского конкурента Dongfang Jingyuan в нарушении патентов. Азиатский стартап опубликовал ответное заявление, в резких выражениях отвергнув все обвинения в незаконной деятельности и подчеркнув, что вся её работа базируется исключительно на добросовестных исследованиях. Компания также намекнула, что не…

February 12, 2022

Technology

iOS 15.0.1 and iPadOS 15.0.1 now available with fix for Apple Watch unlock bug

Remember when new big iOS updates didn't have glaring bugs upon their release, despite months and months of public beta testing beforehand? Neither do we, but we're sure such times did exist at some point. iOS and iPadOS 15 were definitely not releases devoid of any issues, and today Apple has started the bug fixing…

October 2, 2021

Hand-Picked Top-Read Stories

Man City back in top four after 6-0 drubbing of Ipswich

IRS head to resign 3 years before his term expires, as Donald Trump takes office

In a perfect world, Holly Holm would do both MMA and boxing after parting ways with the UFC

Trending Tags

Llama.cpp Now Supports Qwen2-VL (Vision Language Model)

Seagate Exos ST24000NM000C 24TB 7.2K RPM SATA 6Gb/s 512e 3.5in Enterprise Hard Drive (Renewed)

Stanley Quencher H2.0 Tumbler with Handle & Straw 30 oz | Twist On 3-Way Lid | Cupholder Compatible for Travel |…

Luces 100 Days of School Cosplay Set for Girls, Old Lady Dress Up for Kids, 100th Day School Granny Wig…

ihuan Winter Waterproof Ski Gloves Men Women, Snow Warm Cycling Cold Weather, Driving Biking Running

SAMSUNG Galaxy Buds 3 Pro AI True Wireless Bluetooth Earbuds, Noise Cancelling, Sound Optimization, Real-Time Interpreter, Redesigned…

Qatari investors consider Man Utd bid

UFC Kansas City Ceremonial Weigh-In Video

Armaan Kohli’s bail hearing in drug case to take place on October 13

Scammers Are Using QR Codes to Plunder Parking Meter Payments

Could Fungi Actually Cause a Zombie Apocalypse?

Man City back in top four after 6-0 drubbing of Ipswich

IRS head to resign 3 years before his term expires, as Donald Trump takes office

In a perfect world, Holly Holm would do both MMA and boxing after parting ways with the UFC

Ohio State, Notre Dame play in CFP national title game. College Football Fix discusses

Big Ten College Basketball Games: Live Stream and TV Channel Info for January 19, 2025

Llama.cpp Now Supports Qwen2-VL (Vision Language Model)

Related Posts