Sharing my thoughts, discussing my projects, and traveling the world.
Contact: @borz
Last updated 2 weeks, 6 days ago
Telegram stands for freedom and privacy and has many easy to use features.
Last updated 3 weeks ago
Official Graph Messenger (Telegraph) Channel
Download from Google Play Store:
https://play.google.com/store/apps/details?id=ir.ilmili.telegraph
Donation:
https://graphmessenger.com/donate
Last updated 5 months, 1 week ago
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
The paper introduces Tokenformer. The architecture leverages the attention mechanism to facilitate not only inter-token computations but also interactions between tokens and model parameters. The authors replace all linear projection layers in the Transformer with Pattention layers, allowing for efficient incremental scaling without the need for retraining from scratch.
Future work:
- Extending the Mixture-of-Experts Paradigm
- Advancing Parameter-Efficient Tuning
- Integrating Vision and Language Models
- Device-Cloud Collaboration
- Enhancing Model Interpretability
While some of the promises of AI have come true, and technology (like ChatGPT and its plugins) will continue to impress with its capabilities, AI-based technologies have largely failed to live up to the mountainous hype. In 2025, the authors expect the industry to pull back on the promises, investment, and hype of new AI capabilities and settle down into what is real versus marketing noise.
Sharing my thoughts, discussing my projects, and traveling the world.
Contact: @borz
Last updated 2 weeks, 6 days ago
Telegram stands for freedom and privacy and has many easy to use features.
Last updated 3 weeks ago
Official Graph Messenger (Telegraph) Channel
Download from Google Play Store:
https://play.google.com/store/apps/details?id=ir.ilmili.telegraph
Donation:
https://graphmessenger.com/donate
Last updated 5 months, 1 week ago