AI research

Language models defy ‘Stochastic Parrot’ narrative, display semantic learning

Summary Can language models like GPT-4 learn meaning, or are they stochastic parrots? A new research paper shows that the models learn more than some critics give them credit for. In a new study, researchers at CSAIL at the Massachusetts Institute of Technology (MIT) show that language models can learn meaning even if they have …

Language models defy ‘Stochastic Parrot’ narrative, display semantic learning Read More »

With “InternLM”, China enters the race for large language models

Summary InternLM is a large language model with 104 billion parameters introduced by China’s national AI lab, Shanghai AI Lab, together with the surveillance company SenseTime. The Chinese University of Hong Kong, Fudan University and Shanghai Jiaotong University were also involved in its development. On Chinese-language tasks, it clearly outperforms OpenAI’s ChatGPT and Anthropics Claude. …

With “InternLM”, China enters the race for large language models Read More »

AlphaDev could become AlphaFold for coding

Summary Google Deepmind’s AlphaDev is designed to find better computer algorithms. In a test run, the AI ​​system found sorting algorithms that were up to 70% more efficient. Google Deepmind has developed several influential AI models, including AlphaZero and MuZero. These algorithms are used by Google to better manage data centers and compress video. Perhaps …

AlphaDev could become AlphaFold for coding Read More »

New method makes augmented language models more efficient

Summary Language models with access to tools, called augmented language models, have potentially many more capabilities than native language models. The ReWoo method could make them much more efficient. Currently, the most prominent example of an augmented language model is ChatGPT with Internet browsers or plugins. Thanks to these tools, ChatGPT can, for example, retrieve …

New method makes augmented language models more efficient Read More »

New method generates AI images on iPhone in less than 2 seconds

Summary Snapchat’s researchers have developed a new method for AI images on smartphones. This should allow users to eliminate the hardware that would otherwise be required and enjoy greater privacy. Recent versions of image AI, such as Midjourney 5.1, Stable Diffusion XL, and Adobe Firefly, have raised the quality of generated graphics to a new …

New method generates AI images on iPhone in less than 2 seconds Read More »

OpenAI improves GPT-4’s mathematical reasoning with a new form of supervision

Summary OpenAI shows an AI model that achieves SOTA in solving some mathematical problems. The underlying process could lead to better language models in general. In the Let’s Verify Step by Step paper, the OpenAI team trained several models based on GPT-4 to solve problems in the MATH dataset. The goal was to compare two …

OpenAI improves GPT-4’s mathematical reasoning with a new form of supervision Read More »

Scroll to Top