A Sneak Peek into Janus-Pro: DeepSeek-AI's Multimodal Visual Understanding and Generation

janus image
Prompt Example given by Janus: Astronaut in a jungle, cold color palette, muted colors, detailed, 8k

One of the biggest news last days was probably the announcement of DeepSeek-v3. While the whole world is still amazed to learn about DeepSeek, on January 27th 2025 DeepSeek-AI released Janus-Pro. Janus-Pro is a multimodal model that performs both image understanding task and text-to-image generation. We have installed Janus-Pro on our PC and tried it out. Below we share our first impressions on Janus-Pro.

  • Installation: We followed the instruction on the official GitHub page of Janus. We tested the Janus-Pro-1B model on a local PC. 
  • First Impressions:
    • Janus-Pro does a decent job in image understanding, also in multi-lingual. We uploaded an image from a German text book, and it summaries the text in German well, with only a little issue with German umlauts. 
    • The text-to-image generation can be further improved. While the provided prompt examples generate nice images (e.g. the astronaut image above), the prompts we tried seem to generate in lower quality in comparison to Dalle-3 and Midjourney (see examples below).  
janus image
A club logo generated by JanusPro-1B
dalle image
A logo generated by Dalle

We are still very impressed to see Janus-Pro’s performance as a multimodal visual model. Below are more examples we tried. 

janus generated image
Prompt: draw a cartoon style picture of the story of Snow White.
janus generated aerial image
Prompt: generate an aerial image of an old neighbourhood in Florence.
German
Question: what is this article about? Response: this article is about "F�hlingsseel - Osterfest!" which is a celebration of Easter. It includes a poem about the Easter bunny and mentions activities like decorating eggs and making a bunny cake. There is also a section on "F�hlingsseel - Osterfest!" which is a German Easter celebration.