Google Bard Global Update: Supports 40 Languages and Adds Image Generation Feature


Google Gemini Pro/Ultra subscribers can now experience the Veo 3.1 video model, featuring the new 'Ingredients to Video' function: supports uploading three reference images at once, extracting character, scene, and style features respectively, and generating an 8-second 1080p video. The generated content includes an embedded SynthID invisible watermark, supporting text input on web and mobile devices for one-click generation. The system ensures character consistency across frames and consistent lighting, with demonstration cases showing that three self-portraits + cyber city background + oil painting style image can
The Nano Banana2 AI image model has achieved a major breakthrough, overcoming the challenges of complex detail reproduction. By simulating the human multi-stage creative process, it enables image generation to move from random output to controllable refinement, thoroughly solving issues with details such as text, time, and lighting, leading the industry into a new phase of precise generation.
Recently, the Qwen VLo multimodal large model was officially released, achieving significant advancements in image content understanding and generation, offering users a brand-new visual creation experience. According to the introduction, Qwen VLo has been comprehensively upgraded based on the advantages of the original Qwen-VL series models. The model not only can accurately understand the "world", but also can perform high-quality re-creation based on understanding, truly achieving the transition from perception to generation. Users can now access Qwen Chat (chat.qwen.ai)
At the intersection of science and technology, graphs are increasingly attracting the attention of researchers as important tools for expressing complex relationships. From chemical molecule design to social network analysis, graphs play an indispensable role in numerous fields. However, generating graphics efficiently and flexibly has always been a challenging problem. Recently, research teams from Tufts University, Northeastern University, and Cornell University have collaborated to launch a project called Graph Generative Pre-trained Tran.
At the Volcano Engine FORCE Conference on December 18, 2024, Jimed AI officially launched its new poster generation feature. This technology marks another significant advancement in the field of image generation. According to Jimed AI's product manager Li Chao, the standout feature of this poster generation function is its user-friendliness. Users only need to input a simple description, and the system can quickly generate creative posters. This process significantly reduces the time and expertise traditionally required in poster design, allowing more people to participate easily.