| Jan Małaśnicki |
|
|
Conference papers
| 1. | Ludziejewski J.♦, Pióro M., Krajewski J.♦, Stefaniak M.♦, Krutul M.♦, Małaśnicki J.♦, Cygan M.♦, Sankowski P.♦, Adamczewski K.♦, Miłoś P.♦, Jaszczur S.♦, Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient, PMLR, 42nd International Conference on Machine Learning, 2025-07-13/07-19, Vancouver (CA), pp.1-18, 2025 |












