AI research

Study says OpenAI’s business model is sound

Summary The progress of open-source language models is undisputed. But can they really compete with the much pricier, heavily trained language models from OpenAI, Google, and others? Sounds too good to be true: With little training effort and almost no money, open-source language models trained using the Alpaca Formula have set new benchmarks recently, reaching …

Study says OpenAI’s business model is sound Read More »

Guanaco is a ChatGPT competitor trained on a single GPU in one day

Summary A new method named QLoRA enables the fine-tuning of large language models on a single GPU. Researchers used it to train Guanaco, a chatbot that reaches 99% of ChatGPTs performance. Researchers at the University of Washington present QLoRA (Quantized Low Rank Adapters), a method for fine-tuning large language models. Along with QLoRA, the team …

Guanaco is a ChatGPT competitor trained on a single GPU in one day Read More »

Meta’s new open source models speak more than 1,100 languages

Summary As part of the Massively Multilingual Speech project, Meta is releasing AI models that can convert spoken language to text and text to speech in 1,100 languages. The new set of models is based on Meta’s wav2vec, as well as a curated dataset of examples for 1,100 languages ​​and another uncurated dataset for nearly …

Meta’s new open source models speak more than 1,100 languages Read More »

“System 2”-inspired method enhances GPT-4’s logic capability

Summary The “Tree of Thoughts” framework combines tree search with GPT-4 to dramatically improve the problem-solving capabilities of the language model. “Tree of Thoughts” is a new framework from researchers at Princeton University and Google DeepMind for inferencing language models like GPT-4, inspired by prompt engineering methods like Chain of Thought. Unlike those, however, ToT …

“System 2”-inspired method enhances GPT-4’s logic capability Read More »

Chatbot Arena helps you find the best open-source chatbot

Summary Until now, there has been no easy way to compare the quality of open-source models. An e-sports-inspired system could help. The Large Model System Organization (LMSYS), which is behind the open-source model Vicuna, has launched the benchmark platform “Chatbot Arena” to compare the performance of large language models. Different models compete against each other …

Chatbot Arena helps you find the best open-source chatbot Read More »

Google researchers make voice a solid smartphone interface

Summary Until now, AI has had a hard time controlling smartphone interfaces. But Google researchers seem to have found a way. To improve voice-based interaction with mobile user interfaces, researchers at Google Research have been investigating the use of large language models (LLM). Current mobile intelligent assistants are limited in conversational interactions because they cannot …

Google researchers make voice a solid smartphone interface Read More »

HumanRF enables photorealistic 3D avatars

Summary HumanRF brings high-resolution 3D avatars to NeRFs. Behind it is an AI startup for synthetic media. Neural Radiance Fields (NeRFs) learn 3D representations from photos or videos and can render individual objects or entire scenes. Some variants specialize in moving scenes or objects, others experiment with editing capabilities, and others attempt to render people …

HumanRF enables photorealistic 3D avatars Read More »

Scroll to Top