11, Pokrovsky boulevard.
Phone: +7 (495) 531-00-00 *27254
Ahidar-Coutrix A., Le Gouic T., Paris Q.
Probability Theory and Related Fields. 2020.
M. Borisyak, N. Kazeev.
Journal of Instrumentation. 2019. Vol. 14. No. 08. P. 1-8.
Frolov D., Nascimento S., Fenner T. et al.
Information Sciences. 2020. Vol. 512. P. 595-615.
Vetrov D., Izmailov P., Maddox W. J. et al.
In bk.: Proceedings of the 35th Uncertainty in Artificial Intelligence Conference (UAI-2019). 2019. P. 1-11.
In bk.: 34th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS 2019). IEEE, 2019. Ch. 36. P. 1-9.
The faculty trains developers and researchers. The programme has been created based on the experience of leading American and European universities, such as Stanford University (U.S.) and EPFL (Switzerland). Also taken into consideration when creating the faculty was the School of Data Analysis, which is one of the strongest postgraduate schools in the field of computer science in Russia. The wide range of elective courses will allow each student to create his or her own educational path. In the faculty, learning is based on practice and projects.
On August 30, 2015, the Summer School on Machine Learning in High Energy Physics wrapped up this year’s session. The school, which was held at the St. Petersburg Academic University, was organized by HSE in cooperation with the Yandex School of Data Analysis (SDA) and the Yandex Data Factory (YDF). This school is continuing cooperation between Yandex and CERN, which involves YDF and SDA researchers working together with experimental physicists on solving current problems in the field of physics. Many tasks require using machine learning approaches, which allow for greater accuracy and efficiency in these studies.
All of the school’s participants (about 50 people) were divided into two tracks, introductory and advanced. The main focus of the former was to provide an introduction to the principles of machine learning algorithms (decision trees, linear models, and neural networks), model evaluation and the use of classification for physical hypotheses testing; participants in this track also discussed comparison and overfitting in multidimensional distribution by means of machine learning. The advanced track focused on advanced algorithms (feature selection methods, ensemble methods, learning sample manipulations, genetic algorithms, hill climbing, rotation forest, dimensionality reduction, PCA, SVD, nonlinear methods, and deep learning approach) and on the application of algorithms in solving specific physical problems.
In addition to machine learning classes, the school included several overview lectures on various practical aspects of machine learning application in CERN experiments. Staff from the LHCb and CMS experiments spoke about optimization of online filtration of events through the use of machine learning, prediction of qualities in new particles, discovery of the Higgs boson and the search for nonstandard physical processes in experimental data. Online filtration of events in the early stages of events’ processing and reconstruction of the event structure using deep learning approaches in the LHCb experiment is a result of joint work carried out by CERN and HSE researchers.
Particular attention was paid to practical tasks. Seminars included a practical introduction to algorithms and tools that participants can use in further research. In addition to the seminars, a Kaggle competition was organized based on data from the COMET simulator experiment, which is being built in Japan. The aim of this experiment is to discover a brand new physical process that shows itself in neutrino-less conversion of a muon to an electron. Discovery of such conversion would change our knowledge of particle physics, since it contradicts the current standard model. A postdoctoral fellow at Imperial College London and a COMET participant spent two months this spring at an internship at SDA practicing the use of machine learning for searching particles (tracks) of a certain type (the form of tracks that allows the process that has taken place to be judged). As a result of this joint research, the efficiency of algorithms was increased from 83% to 99.9%. The competition was a perfect way to stimulate practical work – the participants were contending for first place on the last day until the final seconds.
Participant profile: physicists 65%, computer scientists 30%, other 5%
The school materials are available in a public repository.
This is the first time the school has been held, but feedback from the participants sounds optimistic:
Would you recommend this school to your friends and colleagues? Answer on a scale from 1 (never) to 5 (yes, of course!):
An interesting result of the school is the solution of the competition problem. This problem was taken from a real track recognition problem in the COMET experiment. It was being solved by Ewen Gillies, a postdoctoral fellow at Imperial College London, during his internship at the HSE Laboratory of Methods for Big Data Analysis under supervision by Alex Rogozhnikov. We simplified the real problem for the school and provided several useful hints, but the results of the school participants’ results are comparable with the quality of practical results. Congratulations to Sergey Korolev and Dmitry Petrov who took first place in our competition!