FAQ
Flash attention is an option for accelerating training and inference. Only NVIDIA GPUs of Turing, Ampere, Ada, and Hopper architecture, e.g., H100, A100, RTX 3090, T4, RTX 2080, can support flash atte...
Explore
139,700 skills indexed with the new KISS metadata standard.
Flash attention is an option for accelerating training and inference. Only NVIDIA GPUs of Turing, Ampere, Ada, and Hopper architecture, e.g., H100, A100, RTX 3090, T4, RTX 2080, can support flash atte...
Flash attention は、トレーニングと推論を加速するオプションです。H100、A100、RTX 3090、T4、RTX 2080 などの Turing、Ampere、Ada、および Hopper アーキテクチャの NVIDIA GPU だけが、flash attention をサポートできます。それをインストールせずに私たちのモデルを使用することができます。
*.so
*.so
/test
*.swp
Read this in English.
In order to make the contribution process as smooth as possible, we have established some
中文README.
generic skill
pycache/
We are happy to accept your contributions to make this repo better and more awesome! To avoid unnecessary work on either
English | 中文
English | 中文
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️
generic skill
*/.DS_Store
Version 2.0, January 2004
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️竞技场/Ar