DeepSeek Unveils Janus Pro AI: Superior Image Generation

By Aayush

DeepSeek has made waves in the AI market by introducing a low-cost ChatGPT competitor with similar capabilities. Now, the company is venturing into image generation with its new open-source model, Janus Pro. According to tests shared by the startup, the results are promising and competitive with existing industry leaders.

The Janus Pro 7B model, launched on Monday (27), features multimodal functionality, allowing it to interpret multiple media formats. DeepSeek also highlighted improvements in image generation capabilities. Benchmark tests released by the company indicate that Janus Pro 7B outperforms OpenAI’s DALL-E 3 and older versions of Stable Diffusion models in key performance metrics.

Advertisements

It’s important to note that these benchmark tests don’t necessarily evaluate the final quality of the generated images. Instead, they focus on the model’s overall performance and its ability to understand and execute user commands effectively.

How Janus Pro works

The Janus Pro 7B model is currently in its demonstration phase but can already be tested on the Hugging Face platform. The functionality is straightforward: users input a text prompt describing their desired image and receive four generated samples.

Advertisements

That said, the tool on Hugging Face does have some limitations. Images are relatively small, and there are noticeable delays in generating responses. However, these issues are likely tied to the demonstration setup and not the model’s final version.

In terms of results, Janus Pro still lags behind leading market competitors and feels more akin to the early versions of similar AI models. For instance, in a test by Canaltech using the prompt “a dog on the streets of Rio de Janeiro,” Janus Pro generated a generic background with only a silhouette of the dog. In contrast, DALL-E produced a far more detailed and colorful image, showing its edge in image quality and refinement.

Advertisements

All Janus Pro files and documentation are available in the DeepSek Github Repository (github.com/deepseek-ai/janus).

What is Deepseek?

DeepSeek is an AI chatbot developed by a Chinese startup of the same name. Launched in January this year, it features the DeepSeek R1 model, which quickly gained attention for delivering results comparable to ChatGPT but with significantly lower infrastructure costs.

Advertisements

The debut of this Chinese AI shook the tech industry, particularly impacting the stock market of major players. Nvidia, a leader in advanced AI chips, experienced a dramatic 17% drop in market value in a single day, with an estimated loss of $580 billion in share value. This underscores the disruptive potential of DeepSeek in the AI landscape.

TAGGED:
Share This Article
Follow:
Aayush is a B.Tech graduate and the talented administrator behind AllTechNerd. . A Tech Enthusiast. Who writes mostly about Technology, Blogging and Digital Marketing.Professional skilled in Search Engine Optimization (SEO), WordPress, Google Webmaster Tools, Google Analytics
Leave a Comment