[VLM] Florence-2 supports online serving #16164

Isotr0py · 2025-04-07T07:18:09Z

Example command to launch the server:

vllm serve microsoft/Florence-2-large --tokenizer facebook/bart-large --trust-remote-code --chat-template examples/template_florence2.jinja

Inference:

chat_response = client.chat.completions.create(
    model=model,
    messages=[
        {
            "role": "user",
            "content": [
                {
                    "type": "image_url",
                    "image_url": {
                        "url": image_url,
                    },
                },
                {"type": "text", "text": "<DETAILED_CAPTION>"},
            ],
        }
    ],
)

FIX #15968

Signed-off-by: Isotr0py <2037008807@qq.com>

github-actions · 2025-04-07T07:18:22Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

DarkLight1337

I can successfully run the model using the chat template and task token (which I have added to the PR description), thanks for working on this!

DarkLight1337 · 2025-04-07T07:47:17Z

However it seems that test_common.py is now failing...

DarkLight1337 · 2025-04-07T07:47:47Z

I suggest updating the chat template to handle the task token externally.

Signed-off-by: Isotr0py <2037008807@qq.com>

Isotr0py · 2025-04-07T08:40:03Z

However it seems that test_common.py is now failing...

Oh, it's just because tokenizer.encode missing add_special_tokens=False. Common tests should pass now.

DarkLight1337 · 2025-04-07T08:41:41Z

Can confirm, this should be good to go then

Signed-off-by: Isotr0py <2037008807@qq.com>

PedroMiolaSilva · 2025-04-15T14:03:54Z

@Isotr0py nice, thanks a lot!

Just a quick question, I've being trying to use the tasks from Florence that are suppose to return the position from the objects (OD, OCR_WITH_REGION, etc) and I'm getting an empty response, or just the object names but no box positions. Description tasks work fine.

This is how I'm using it:

docker run \ --runtime nvidia \ -e VLLM_USE_V1=0 \ --gpus 0 \ --ipc=host \ -p "8000:8000" \ --env "HUGGING_FACE_HUB_TOKEN=${HUGGING_FACE_HUB_TOKEN}" \ -v "${HF_HOME}:/root/.cache/huggingface" \ -v "$(pwd):/app" \ vllm/vllm-openai:latest \ --tensor-parallel-size 1 \ --model microsoft/Florence-2-base\ --tokenizer facebook/bart-large \ --gpu-memory-utilization 0.95 \ --trust-remote-code \ --chat-template /app/template_florence2.jinja \ --max-model-len 1024 \ --max-num-seqs 8 \ --dtype float16

With the following cURL to test it:

curl -X POST http://localhost:8000/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "microsoft/Florence-2-base", "messages": [ { "role": "user", "content": [ { "type": "image_url", "image_url": { "url": "https://media.istockphoto.com/id/1152862811/pt/foto/bavarian-dance.jpg?s=1024x1024&w=is&k=20&c=ie6_xDrdmnkDd4Udqn8n2kP_xeRpjkGdTduvO3J4KT4=" } }, {"type":"text","text":"<OCR_WITH_REGION>"} ] } ] }'

An the response:
{"id":"chatcmpl-a9740a45bc1441dab29ad22fe4c1a791","object":"chat.completion","created":1744725371,"model":"microsoft/Florence-2-base","choices":[{"index":0,"message":{"role":"assistant","reasoning_content":null,"content":"iStockCredit: Foo TooEditorial use only1152862811","tool_calls":[]},"logprobs":null,"finish_reason":"stop","stop_reason":null}],"usage":{"prompt_tokens":590,"total_tokens":637,"completion_tokens":47,"prompt_tokens_details":null},"prompt_logprobs":null}

Is there anything I'm missing or doing it wrong?

Isotr0py · 2025-04-16T17:03:34Z

@PedroMiolaSilva Can you try to use this tokenizer(Isotr0py/Florence-2-tokenizer) and add skip_special_tokens=False to extra_body as sampling parameters?

PedroMiolaSilva · 2025-04-17T13:13:25Z

@Isotr0py it worked, thanks a lot!

For this model, shouldn't the default value for skip_special_tokens be set to True?

Isotr0py · 2025-04-17T13:21:49Z

For this model, shouldn't the default value for skip_special_tokens be set to True?

No, because Florence-2 uses special tokens like <loc_{x}>; x=1, ,2, ..., 999 to present locations, if we set skip_special_tokens=True, these special tokens will be skipped in tokenizer decoding. So you will get just the object names without positions locations.

PedroMiolaSilva · 2025-04-17T13:24:29Z

I'm sorry, I mean set to False, because by default it is set to true right?

Isotr0py · 2025-04-17T13:38:23Z

because by default it is set to true right?

Yes. It's set to True by default in SamplingParams.

Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: Yang Wang <elainewy@meta.com>

Isotr0py added 2 commits April 6, 2025 17:28

florence-2 support online serving

fcad614

Signed-off-by: Isotr0py <2037008807@qq.com>

override tokens only apply

444d9e9

Signed-off-by: Isotr0py <2037008807@qq.com>

Isotr0py requested a review from DarkLight1337 April 7, 2025 07:18

mergify bot added documentation Improvements or additions to documentation frontend labels Apr 7, 2025

Isotr0py mentioned this pull request Apr 7, 2025

[Bug]: Crashing server running Florence-2 when trying to call as multi modal #15968

Closed

1 task

DarkLight1337 approved these changes Apr 7, 2025

View reviewed changes

fix redundant bos and eos token

d3cfce5

Signed-off-by: Isotr0py <2037008807@qq.com>

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 7, 2025

DarkLight1337 enabled auto-merge (squash) April 7, 2025 08:41

vllm-bot merged commit 7c80368 into vllm-project:main Apr 7, 2025
44 of 49 checks passed

Isotr0py deleted the florence-2-online branch April 7, 2025 13:37

lengrongfu pushed a commit to lengrongfu/vllm that referenced this pull request Apr 7, 2025

[VLM] Florence-2 supports online serving (vllm-project#16164)

1fe61ca

Signed-off-by: Isotr0py <2037008807@qq.com>

nishith-fujitsu pushed a commit to nishith-fujitsu/vllm that referenced this pull request Apr 9, 2025

[VLM] Florence-2 supports online serving (vllm-project#16164)

0962b7d

Signed-off-by: Isotr0py <2037008807@qq.com>

Isotr0py mentioned this pull request Apr 16, 2025

[Bugfix] Update Florence-2 tokenizer to make grounding tasks work #16734

Merged

yangw-dev pushed a commit to yangw-dev/vllm that referenced this pull request Apr 21, 2025

[VLM] Florence-2 supports online serving (vllm-project#16164)

de4c7ec

Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: Yang Wang <elainewy@meta.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VLM] Florence-2 supports online serving #16164

[VLM] Florence-2 supports online serving #16164

Isotr0py commented Apr 7, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Apr 7, 2025

DarkLight1337 left a comment

DarkLight1337 commented Apr 7, 2025

DarkLight1337 commented Apr 7, 2025

Isotr0py commented Apr 7, 2025

DarkLight1337 commented Apr 7, 2025

PedroMiolaSilva commented Apr 15, 2025

Isotr0py commented Apr 16, 2025

PedroMiolaSilva commented Apr 17, 2025

Isotr0py commented Apr 17, 2025

PedroMiolaSilva commented Apr 17, 2025

Isotr0py commented Apr 17, 2025

[VLM] Florence-2 supports online serving #16164

[VLM] Florence-2 supports online serving #16164

Conversation

Isotr0py commented Apr 7, 2025 • edited by github-actions bot Loading

github-actions bot commented Apr 7, 2025

DarkLight1337 left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Apr 7, 2025

DarkLight1337 commented Apr 7, 2025

Isotr0py commented Apr 7, 2025

DarkLight1337 commented Apr 7, 2025

PedroMiolaSilva commented Apr 15, 2025

Isotr0py commented Apr 16, 2025

PedroMiolaSilva commented Apr 17, 2025

Isotr0py commented Apr 17, 2025

PedroMiolaSilva commented Apr 17, 2025

Isotr0py commented Apr 17, 2025

Isotr0py commented Apr 7, 2025 •

edited by github-actions bot

Loading