Poster session
Hebbian Sparse Autoencoder
ICLR NFAM Workshop
Nikita Kurdiukov, Anton Razzhigaev
Certification of speaker recognition models to additive perturbations
AAAI
Dmitrii Korzh, Elvir Karimov, Mikhail Pautov, Oleg Y Rogov, Ivan Oseledets
Clear: Character unlearning in textual and visual modalities
ACL
Alexey Dontsov, Dmitrii Korzh, Alexey Zhavoronkin, Boris Mikheev, Denis Bobkov, Aibek Alanov, Oleg Y Rogov, Ivan Oseledets, Elena Tutubalina
ACMMM
Igor Meleshin, Anna Chistyakova, Anastasia Antsiferova, Dmitriy Vatolin
ICLR
Milena Gazdieva, Jaemoo Choi, Alexander Kolesov, Jaewoong Choi, Petr Mokrov, Alexander Korotin
ACL
Kristian Kuznetsov, Laida Kushnareva, Polina Druzhinina, Anton Razzhigaev, Anastasia Voznyuk, Irina Piontkovskaya, Evgeny Burnaev, Serguei Barannikov
SIGIR
Julia Belikova, Konstantin Polev, Rauf Parchiev, Dmitry Simakov
EMNLP
Elisei Rykov, Kseniia Petrushina, Maksim Savkin, Valerii Olisov, Artem Vazhentsev, Kseniia Titova, Alexander Panchenko, Vasily Konovalov, Julia Belikova
IJCAI
Alina Kostromina, Kseniia Kuvshinova, Aleksandr Yugay, Andrey Savchenko, Dmitry Simakov
KDD
Oleg Kachan; Andrey Savchenko; Gleb Guse
Gaze into the Heart: A Multi-View Video Dataset for rPPG and Health Biomarkers Estimation
ACMMM
Konstantin Egorov, Stepan Botman, Pavel Blinov, Galina Zubkova, Anton Ivaschenko, Alexander Kolsanov, Andrey Savchenko
MADD: Multi-Agent Drug Discovery Orchestra
EMNLP
Gleb Vitalevich Solovev, Alina Borisovna Zhidkovskaya, Anastasia Orlova, Nina Gubina, Anastasia Vepreva, Rodion Golovinskii, Ilya Tonkii, Ivan Dubrovsky, Ivan Gurev, Dmitry Gilemkhanov, Denis Chistiakov, Timur A. Aliev, Ivan Poddiakov, Galina Zubkova, Ekaterina V. Skorb, Vladimir Vinogradov, Alexander Boukhanovsky, Nikolay Nikitin, Andrei Dmitrenko, Anna Kalyuzhnaya, Andrey Savchenko
PyTorch-Lifestream: Learning Embeddings on Discrete Event Sequences
IJCAI
Artem Sakhno, Ivan Kireev, Dmitrii Babaev, Maxim Savchenko, Gleb Gusev, Andrey Savchenko
ATGen: A Framework for Active Text Generation
ACL
Akim Tsvigun Daniil Vasilev Ivan Tsvigun Ivan Lysenko Talgat Bektleuov Aleksandr Medvedev Uliana Vinogradova Nikita Severin Mikhail Mozikov Andrey Savchenko Rostislav Grigorev Ramil Kuleev Fedor Zhdanov Artem Shelmanov Ilya Makarov
HL-EAI: A Multimodal Framework Enabling Emotional Reciprocity in Human–AI Strategic Decision-Making
ACMMM
Mikhail Mozikov, Daniil Orekhov, Nasonov Ivan, Konstantin Baltsat, Pedashenko Vladislav, Dmitrii Abramov, Nikita Severin, Yury Maximov, Andrey Savchenko, Ilya Makarov
FaceCluster: Interactive Photo Organization with Enhanced Face Recognition
ACMMM
Alexander Filonenko, Ilya Makarov, Andrey V. Savchenko
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
EMNLP
Ivan Sviridov, Amina Miftahova, Tereshchenko Artemiy Vladimirovich, Galina Zubkova, Pavel Blinov, Andrey Savchenko
ICCV
Tatiana Zemskova, Dmitry Yudin
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
ICLR
Timofei Gritsaev, Nikita Morozov, Sergey Samsonov, Daniil Tiapkin
Revisiting Non-Acyclic GFlowNets in Discrete Environments
ICML
Nikita Morozov , Ian Maksimov, Daniil Tiapkin, Sergey Samsonov
Nonasymptotic Analysis of Stochastic Gradient Descent with the Richardson-Romberg Extrapolation
ICLR
Marina Sheshukova, Denis Belomestny, Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov
Refined Analysis of Constant Step Size Federated Averaging and Federated Richardson-Romberg Extrapolation
AISTATS
Paul Mangold, Alain Durmus, Aymeric Dieuleveut, Sergey Samsonov, Eric Moulines.
Statistical inference for Linear Stochastic Approximation with Markovian Noise
NeurIPS
Sergey Samsonov, Marina Sheshukova, Eric Moulines, Alexey Naumov
CayleyPy Growth: Efficient growth computations and hundreds of new conjectures on Cayley graphs
NeurIPS
Alexander Chervov, Dmytro Fedoriaka, Mark Obozov, Elena V. Konstantinova, Anton Naumov, Igor Kiselev, Anastasia Sheveleva, Ivan Koltsov, Sergei Lytkin, Andrei Smolensky, Alexander Soibelman, Fedor Levkovich-Maslyuk, Ruslan Grimov, Dmitry Volovich, Artem Isakov, Anton Kostin, Michael Litvinov, Nick Vilkin-Krom, Alim Bidzhiev, Artem Krasnyi, Mikhail Evseev, Elizaveta Geraseva, Liliya Grunwald, Sergey Galkin, Eduard Koldunov, Stanislav Diner, Artem Chevychelov, Evelina Kudasheva, Arsenii Sychev, Zakhar Kogan, Altana Natyrova, Lidia Shishina, Lyudmila Cheldieva, Vladislav Zamkovoy, Dmitrii Kovalenko, Oleg Papulov, Kudashev Sergey, Dmitry Shiltsov, Rustem Turtayev, Olga Nikitina, Dariya Mamayeva, Nikolenko Sergei, Anton Titarenko, Antonina Dolgorukova, Alexey N. Aparnev, Orianne Debeaupuis, Simo Alami Chehboune, Herve Isambert
A machine learning approach that beats Rubik's cubes
NeurIPS Spotlight
Alexander Chervov, Kirill Khoruzhii, Nikita Bukhal, Jalal Naghiyev, Vladislav Zamkovoy, Ivan Koltsov, Lyudmila Cheldieva, Arsenii Sychev, Arsenii Lenin, Mark Obozov, Egor Urvanov, Alexey M. Romanov
Think, Align, Select: Query–Key Scores for LLM Reasoning
NeurIPS
Mark Obozov, Eduard Tulchinskii, Kristian Kuznetsov, Michael Diskin, Serguei Barannikov
Synthetic Proofs with Tool-Integrated Reasoning: Contrastive Alignment for LLM Mathematics with Lean
EMNLP
Mark Obozov, Michael Diskin, Aleksandr Beznosikov, Alexander Gasnikov, Serguei Barannikov
AutoIntent: AutoML for Text Classification
EMNLP
Ilya Alekseev, Roman Solomatin, Darina Rustamova, Denis Kuznetsov
Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization
NeurIPS
Mikhail Persiianov, Arip Asadulaev, Nikita Andreev, Nikita Starodubcev, Dmitry Baranchuk, Anastasis Kratsios, Evgeny Burnaev, Alexander Korotin
Learning of Population Dynamics: Inverse Optimization Meets JKO Scheme
NeurIPS
Mikhail Persiianov, Jiawei Chen, Petr Mokrov, Alexander Tyurin, Evgeny Burnaev, Alexander Korotin
GLGENN: A Novel Parameter-Light Equivariant Neural Networks Architecture Based on Clifford Geometric Algebras
ICML
Ekaterina Filimoshina, Dmitry Shirokov
Steering LLM Reasoning Through Bias-Only Adaptation
EMNLP
Viacheslav Sinii, Alexey Gorbatovski, Artem Cherepanov, Boris Shaposhnikov, Nikita Balagansky, Daniil Gavrilov
Field Matching: An electrostatic paradigm to Generate and Transfer data
ICML
Alexander Kolesov, Stepan Manukhov, Vladimir V. Palyulin, Alexnder Korotin
TabM: Advancing tabular deep learning with parameter-efficient ensembling
ICLR
Yury Gorishniy, Akim Kotelnikov, Artem Babenko
CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World
ACL
Zoya Volovikova, Gregory Gorbov, Petr Kuderov, Aleksandr Panov, Alexey Skrynnik
GraphLand: Evaluating Graph Machine Learning Models on Diverse Industrial Data
NeurIPS
Gleb Bazhenov, Oleg Platonov, Liudmila Prokhorenkova
Decentralized Optimization with Coupled Constraints
ICLR
Demyan Yarmoshik, Alexander Rogozin, Nikita Kiselev, Daniil Dorin, Alexander Gasnikov, Dmitry Kovalev
When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs
EMNLP
Mikhail Seleznyov, Mikhail Chaichuk, Gleb Ershov, Alexander Panchenko, Elena Tutubalina, Oleg Somov
Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting
NeurIPS
Sergei Kholkin, Grigoriy Ksenofontov, David Li, Nikita Kornilov, Nikita Gushchin, Alexandra Suvorikova, Alexey Kroshnin, Evgeny Burnaev, Alexander Korotin
RuSemCor: a Word Sense Disambiguation corpus for Russian
CIKM
Dmitry Ilvovsky, Alik Kirillovich, Ilya Karpov, Maxim Kulaev, Natalia Loukachevitch
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA
EMNLP
Sergey Pletenev, Maria Marina, Nikolay Ivanov, Daria Galimzianova, Nikita Krayko, Mikhail Salnikov, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii
RusConText Benchmark: A Russian Language Evaluation Benchmark for Understanding Context
ACL SRW
Andrey Chirkin, Svetlana Kuznetsova, Maria Volina, Anna Dengina
Analyze Feature Flow to Enhance Interpretation and Steering in Language Models
ICML
Daniil Laptev, Nikita Balagansky, Yaroslav Aksenov, Daniil Gavrilov
An LLM-Powered Tool for Enhancing Scientific Open-Source Repositories
ICML Workshop
Nikolay Nikitin, Andrey Getmanov, Zakhar Popov, Ulyanova Ekaterina Alekseevna, Yaroslav Aksenkin, Ilya Sokolov, Alexander Boukhanovsky
Inverse Bridge Matching Distillation
ICML
Nikita Gushchin, David Li, Daniil Selikhanovych, Evgeny Burnaev, Dmitry Baranchuk, Alexander Korotin
Alchemist: Turning Public Text-to-Image Data into Generative Gold
NeurIPS
Valerii Startsev, Alexander Ustyuzhanin, Alexey Kirillov, Dmitry Baranchuk, Sergey Kastryulin
Diffusion on Language Model Encodings for Protein Sequence Generation
ICML
Meshchaninov, V., Strashnov, P., Shevtsov, A., Nikolaev, F., Ivanisenko, N., Kardymon, O., and Vetrov, D.
Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy
EMNLP
Nikita Balagansky, Yaroslav Aksenov, Daniil Laptev, Vadim Kurochkin, Gleb Gerasimov, Nikita Koryagin, Daniil Gavrilov
AutoJudge: Judge Decoding Without Manual Annotation
NeurIPS
Roman Garipov, Fedor Velikonivtsev, Ivan Ermakov, Ruslan Svirschevski, Vage Egiazarian, Max Ryabinin
Generalization error bound for denoising score matching under relaxed manifold assumption
COLT
Konstantin Yakovlev, Nikita Puchkin
Knowledge Graph Completion with Mixed Geometry Tensor Factorization
AISTATS
Viacheslav Yusupov, Maxim Rakhuba, Evgeny Frolov
Exploring the Hidden Capacity of LLMs for One-Step Text Generation
EMNLP
Gleb Mezentsev, Ivan Oseledets
Recurrent Action Transformer with Memory
NeurIPS
Egor Cherepanov, Aleksei Staroverov, Alexey Kovalev, Aleksandr Panov
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
NeurIPSSpotlight
Gleb Rodionov, Roman Garipov, Alina Shutova, George Yakushev, Erik Schultheis, Vage Egiazarian, Anton Sinitsin, Denis Kuznedelev, Dan Alistarh
Block-wise distillation for lightweight weather models
NeurIPS
Daniil Sukhorukov , Andrei Zakharov, Dmitry Zhevnenko, Vladimir Kirilin, Ekaterina Muravleva, Ivan Oseledets, Ilya Makarov
Frozen in the Middle: Hidden States Remain Unchanged Across Intermediate Layers of Language Models
CIKM
Pavel Tikhonov, Dmitry Ilvovsky
Mechanistic Permutability: Match Features Across Layers
ICLR
Nikita Balagansky, Ian Maksimov, Daniil Gavrilov
Evaluating robustness of tabular models under meta-features based shifts
NeurIPS Workshop
Irina Deeva, Nargiza Amerkhanova, Alena Kropacheva
Time to Split: Exploring Data Splitting Strategies for Offline Evaluation of Sequential Recommenders
RecSys
Danil Gusak, Anna Volodkevich, Anton Klenitskiy, Alexey Vasilev, Evgeny Frolov
AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment
ACL
Anastasiia Ivanova, Eva Bakaeva, Zoya Volovikova, Alexey K. Kovalev, Aleksandr I. Panov
Pisets: A Robust Speech Recognition System for Lectures and Interviews
NAACL
Ivan Bondarenko, Daniil Grebenkin, Oleg Sedukhin, Mikhail Klementev, Derunets Roman, Lyudmila Budneva
IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models
ICCVSpotlight
Khaled Abud; Sergey Lavrushkin; Alexey Kirillov; Dmitriy Vatolin
Meta-features informed WGAN for tabular data
ICDM
Roman Netrogolov, Irina Deeva
EBES: Easy Benchmarking for Event Sequences
KDD
Dmitry Osin, Igor Udovichenko, Viktor Moskvoretskii, Egor Shvetsov, Evgeny Burnaev
LLM-Independent Adaptive RAG: Let the Question Speak for Itself
EMNLP
Maria Marina, Nikolay Ivanov, Sergey Pletenev, Mikhail Salnikov, Daria Galimzianova, Nikita Krayko, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii
COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation
NeurIPS
Uliana Parkina, Maxim Rakhuba
Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order
TTODLer-FM workshop
Egor Petrov, Grigoriy Evseev, Aleksey Antonov, Andrey Veprikov, Pavel Plyusnin, Nikolay Bushkov, Stanislav Moiseev, Aleksandr Beznosikov
GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization
IROS Oral
Gennady Sidorov, Malik Mohrat, Denis Gridusov, Rakhimov, Sergey Kolyubin
CHECK-MAT: Probing the Mathematical Reasoning and Rubric-Alignment of Vision-Language Models on Handwritten Solutions
EMNLP
Ruslan Khrulev
Clipping Improves Adam-Norm and AdaGrad-Norm when the Noise Is Heavy-Tailed
ICML
Savelii Chezhegov, Yaroslav Klyukin, Andrei Semenov, Aleksandr Beznosikov, Alexander Gasnikov, Samuel Horvath, Martin Takac, Eduard Gorbunov
Automatic Image Translation of Long Ancient Egyptian Texts for Augmented Reality Applications
ISMAR
Innokentiy Humonen, Maksim Golyadkin, Danil Kalin, Ilya Makarov
Curse of Slicing: Why Sliced Mutual Information is a Deceptive Measure of Statistical Dependence
NeurIPS
Alexander Semenenko, Ivan Butakov, Alexey Frolov, Ivan Oseledets
Correcting the LogQ Correction: Revisiting Sampled Softmax for Large-Scale Retrieval
RecSys
Kirill Khrylchenko, Vladimir Baikalov, Sergei Makeev, Artem Matveev, Sergei Liamaev
Sample complexity of Schrodinger potential estimation
NeurIPS
Nikita Puchkin, Iurii Pustovalov, Yuri Sapronov, Denis Suchkov, Alexey Naumov, Denis Belomestny
Efficient Distribution Matching of Representations via Noise-Injected Deep InfoMax
ICLR
Ivan Butakov, Alexander Semenenko, Alexander Tolmachev, Andrey Gladkov, Marina Munkhoeva, Alexey Frolov
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
NeurIPSSpotlight
Egor Cherepanov, Nikita Kachaev, Alexey K. Kovalev, Aleksandr I. Panov
Let It Go? Not Quite: Addressing Item Cold Start in Sequential Recommendations with Content-Based Initialization
RecSys
Anton Pembek, Artem Fatkulin, Anton Klenitskiy, Alexey Vasilev
LERa: Replanning with Visual Feedback in Instruction Following
IROS
Svyatoslav Pchelintsev, Maxim Patratskiy, Anatoly Onishchenko, Alexandr Korchemnyi, Aleksandr Medvedev, Uliana Vinogradova, Ilya Galuzinsky, Aleksey Postnikov, Alexey K. Kovalev, Aleksandr I. Panov
MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models
CVPR
Kamil Garifullin, Maxim Nikolaev, Andrey Kuznetsov, Aibek Alanov
MMTEB: Massive Multilingual Text Embedding Benchmark
ICLR
Kenneth Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, Jay Gala, Wissam Siblini, Dominik Krzemiński, Genta Indra Winata, Saba Sturua, Saiteja Utpala, Mathieu Ciancone, Marion Schaeffer, Gabriel Sequeira, Diganta Misra, Shreeya Dhakal, Jonathan Rystrøm, Roman Solomatin, Ömer Çağatan, Akash Kundu, Martin Bernstorff, Shitao Xiao, Akshita Sukhlecha, Bhavish Pahwa, Rafał Poświata, Kranthi Kiran GV, Shawon Ashraf, Daniel Auras, Björn Plüster, Jan Philipp Harries, Loïc Magne, Isabelle Mohr, Mariya Hendriksen, Dawei Zhu, Hippolyte Gisserot-Boukhlef, Tom Aarsen, Jan Kostkan, Konrad Wojtasik, Taemin Lee, Marek Šuppa, Crystina Zhang, Roberta Rocca, Mohammed Hamdy, Andrianos Michail, John Yang, Manuel Faysse, Aleksei Vatolin, Nandan Thakur, Manan Dey, Dipam Vasani, Pranjal Chitale, Simone Tedeschi, Nguyen Tai, Artem Snegirev, Michael Günther, Mengzhou Xia, Weijia Shi, Xing Han Lù, Jordan Clive, Gayatri Krishnakumar, Anna Maksimova, Silvan Wehrli, Maria Tikhonova, Henil Panchal, Aleksandr Abramov, Malte Ostendorff, Zheng Liu, Simon Clematide, Lester James Miranda, Alena Fenogenova, Guangyu Song, Ruqiya Bin Safi, Wen-Ding Li, Alessia Borghini, Federico Cassano, Hongjin Su, Jimmy Lin, Howard Yen, Lasse Hansen, Sara Hooker, Chenghao Xiao, Vaibhav Adlakha, Orion Weller, Siva Reddy
The Russian-focused embedders’ exploration: ruMTEB benchmark and Russian embedding model design
NAACL
Artem Snegirev, Maria Tikhonova, Anna Maksimova, Alena Fenogenova, Alexander Abramov
Does LLM dream of differential equation discovery?
NeurIPS
Elizaveta Ivanchik, Timur Bavshin, Alexander Hvatov
MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation
ICCV
Vladislav Bargatin, Egor Chistov, Alexander Yakovenko, Dmitriy Vatolin
On Linear Convergence in Smooth Convex-Concave Bilinearly-Coupled Saddle-Point Optimization: Lower Bounds and Optimal Algorithms
ICML
Ekaterina Borodich, Alexander Gasnikov, Dmitry Kovalev
Learning geometry-aware recommender systems with manifold regularization
RecSys
Zaira Zainulabidova, Julia Borisova, Alexander Hvatov
Color Transfer with Modulated Flows
AAAI
Maria Larchenko, Alexander Lobashev, Dmitry Guskov, Vladimir Vladimirovich Palyulin
Color Conditional Generation with Sliced Wasserstein Guidance
NeurIPSSpotlight
Alexander Lobashev, Maria Larchenko, Dmitry Guskov
Hessian Geometry of Latent Space in Generative Models
ICML
Alexander Lobashev, Dmitry Guskov, Maria Larchenko, Mikhail Tamm
SMMR: Sampling-Based MMR Reranking for Faster, More Diverse, and Balanced Recommendations and Retrieval
SIGIR
Kiryl Liakhnovich, Oleg Lashinin, Andrey Babkin, Michael Pechatov, Marina Ananyeva
Sign Operator for Coping with Heavy-Tailed Noise in Non-Convex Optimization: High Probability Bounds Under (L_0, L_1)-Smoothness
NeurIPS
Nikita Kornilov, Philip Zmushko, Andrei Semenov, Mark Ikonnikov, Alexander Gasnikov, Aleksandr Beznosikov
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph
ICRA
Sergey Linok, Tatiana Zemskova, Svetlana Ladanova, Roman Titkov, Dmitry Yudin, Maxim Monastyrny, Aleksei Valenkov
ToolReflection: Improving Large Language Models for Real-World API Calls with Self-Generated Data
ACL
Gregory Polyakov, Ilseyar Alimova, Dmitry Abulkhanov, Ivan Sedykh, Andrey Bout, Sergey Nikolenko, Irina Piontkovskaya
Autonomous AI-Driven Grid Protection: Sub-Cycle Fault Response via NPU-Optimized Neural Networks
SENSYS
Aleksandr Kovalenko, Aleksey Evdakov, Galina Filatova, Andrey Yablokov, Aleksandr Bulashov, Ilya Makarov
A theoretical framework for self-supervised contrastive learning for continuous dependent data
ICDM
Alexander Marusov, Aleksandr Yugay, Alexey Zaytsev
ReplaceMe: Network Simplification via Depth Pruning and Transformer Block Linearization
NeurIPS
Dmitriy Shopkhoev, Ammar Ali, Magauiya Zhussip, Valentin Malykh, Stamatios Lefkimmiatis, Nikos Komodakis, Sergey Zagoruyko
You Do Not Fully Utilize Transformer's Representation Capacity
NeurIPS
Gleb Gerasimov, Yaroslav Aksenov, Nikita Balagansky, Viacheslav Sinii, Daniil Gavrilov
SPY: Enhancing Privacy with Synthetic PII Detection Dataset
NAACL SRW
Maksim Savkin, Timur Ionov, Vasily Konovalov
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity
ACLSpotlight
Yuri Kuratov, Mikhail Arkhipov, Aydar Bulatov, Mikhail Burtsev
Teach Old SAEs New Domain Tricks with Boosting
COLM
Nikita Koriagin, Yaroslav Aksenov, Daniil Laptev, Gleb Gerasimov, Nikita Balagansky, Daniil Gavrilov
Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models
ICML
Alina Shutova, Vladimir Malinovskii, Vage Egiazarian, Denis Kuznedelev, Denis Mazur, Nikita Surkov, Ivan Ermakov, Dan Alistarh
Yambda-5B -- A Large-Scale Multi-modal Dataset for Ranking And Retrieval
RecSys
A. Ploshkin, V. Tytskiy, A. Pismenny, V. Baikalov, E. Taychinov, A. Permiakov, D. Burlakov, E. Krofto, N. Savushkin
RTD-Lite: Scalable Topological Analysis for Comparing Weighted Graphs in Learning Tasks
AISTATS
Tulchinskii E., Voronkova D., Trofimov I., Burnaev E., Barannikov S.
ProcrustesGPT: Compressing LLMs with Structured Matrices and Orthogonal Transformations
ACL
Ekaterina Grishina, Mikhail Gorbunov, Maxim Rakhuba