This directory contains a NumPy implementation of Transformer components with both forward and backward passes. The implementations are validated against PyTorch to ensure gradient correctness. Add ...