ELYZA Releases Japanese LLM Based on Llama 2, with 7 Billion Parameters, Competing with GPT-3.5


Google has launched a new series of open-source models called 'Gemma'. A comparison between Gemma, Llama 2, and Mistral. The design principles and features of Gemma. Characteristics of the GeGLU activation function used in Gemma.
Meta has introduced the next-generation large language model Llama 2, which surpasses GPT-3.5 Turbo and Claude 2 on certain tasks. Released in July 2022, Llama 2 offers three different model sizes: 7 billion, 13 billion, and 70 billion parameters. Meta is collaborating with Dell to assist enterprise customers in deploying Llama 2 models locally instead of solely relying on cloud services. The Llama 2 model is open-source and available for research and some commercial use. Meta has partnered with Microsoft and Amazon.
Microsoft recently announced a series of upgrades to its AI suite. They announced support for the Llama 2 large model and the launch of the GPT-4 version of Office. Llama 2 is a new AI model with powerful natural language processing capabilities. The GPT-4 version of Office is a premium office suite offered by Microsoft, available for users through a $30/month rental model. This upgrade signifies that Microsoft's AI technology will become more advanced and comprehensive.
Recently, automated AI agent startup CrewAI announced that it has successfully raised $18 million in funding. This round includes a seed round led by Boldstart Ventures and a Series A round led by Insight Partners. Additionally, Blitzscaling Ventures, Craft Ventures, Earl Grey Capital, and other prominent investors participated, along with well-known AI experts.
Google's DeepMind, in collaboration with the Massachusetts Institute of Technology (MIT), recently announced a significant research breakthrough. The research team developed a new autoregressive model called 'Fluid', which has made groundbreaking progress in the field of text-to-image generation. After scaling up to 10.5 billion parameters, the model has demonstrated outstanding performance. This research overturns the industry's common perception; while autoregressive models have dominated the language processing domain, they have traditionally been seen as inferior to Stable Diffusion and Goog.