Talos Research Wiki

architecture

6 items with this tag.

  • 27 Jun 2026

    Attention Mechanism

    • llm
    • transformer
    • attention
    • architecture
  • 27 Jun 2026

    Feed-Forward Networks (SwiGLU)

    • llm
    • transformer
    • architecture
  • 27 Jun 2026

    KV Cache & Efficient Attention

    • transformer
    • architecture
    • inference
  • 27 Jun 2026

    Layer Normalization & RMSNorm

    • llm
    • transformer
    • architecture
  • 27 Jun 2026

    Positional Encoding

    • llm
    • transformer
    • architecture
  • 27 Jun 2026

    Transformer Architecture

    • llm
    • transformer
    • architecture

Created with Quartz v5.0.0 © 2026

  • Powered by Quartz