Upgrade 2024: Let's Upgrade Reality

April 11, 2024 // Upgrade 2024

tsuzumi

Kyosuke Nishida

Summary

tsuzumi

tsuzumi is a large-scale language model created by NTT Laboratories. The name is inspired by the traditional Japanese drum “鼓” and the model reflects the instrument’s compact and efficient design. Our vision for the future involves tackling societal challenges through the collaborative intelligence of a network of smaller, specialized LLMs like tsuzumi. 

In this presentation, Kyosuke Nishida, Senior Distinguished Researcher in the NTT Human Informatics Laboratories, demonstrates the tsuzumi-7B model, which was developed from scratch and features 7 billion parameters and over one trillion Japanese and English tokens. A vision-and-language model using tsuzumi for visual document understanding is also showcased.

Your Privacy

When you visit any website, it may store or retrieve information on your browser, mostly in the form of cookies. This information might be about you, your preferences or your device and is mostly used to make the site work as you expect it to. The information does not usually directly identify you, but it can give you a more personalized web experience. Because we respect your right to privacy, you can choose not to allow some types of cookies. Click on the different category headings to find out more and change our default settings. However, blocking some types of cookies may impact your experience of the site and the services we are able to offer.