Back to news

NeurIPS 2022: Paper review #1

25 January 2023

Quantitative Research

G-Research were headline sponsors at NeurIPS 2022, in New Orleans.

ML is a fast-evolving discipline; attending conferences like NeurIPS and keeping up-to-date with the latest developments is key to the success of our quantitative researchers and machine learning engineers.

Our NeurIPS 2022 paper review series gives you the opportunity to hear about the research and papers that our quants and ML engineers found most interesting from the conference.

Here, Sebastian L, Quantitative Researcher at G-Research, discusses two papers from NeurIPS:

Focal Modulation Networks
Reconstructing Training Data from Trained Neural Networks

Focal Modulation Networks

Jianwei Yang, Chunyuan Li, Xiyang Dai, Lu Yuan, Jianfeng Gao

This paper proposes a new general-purpose image processing architecture, which combines ideas from image transformers and convolutional architectures into a new module the authors name focal modulation.

At a high level, the authors argue that a key idea of the paper is to change the order of context aggregation, and global to local feature interaction, compared to self-attention architectures.

Focal modulation performs context aggregation first, following the authors’ aims of building a more compute-efficient architecture.

Hierarchical context aggregation is performed by an independent subnetwork. This consists of stacked depth-wise convolutions, the results of which are gated and summed into a modulation feature map. This map interacts with the query feature map in a pointwise fashion.

The paper shows state-of-the-art results of the new focal attention module, compared with modern self-attention-based vision architectures on classification and segmentation tasks, including ImageNet 1k and 22k, as well as on the COCO segmentation challenge.

The authors demonstrate on multiple examples that the modulation feature map naturally learns to attend to semantically meaningful image regions, such as the foreground or objects of interest.

Reconstructing Training Data from Trained Neural Networks

Niv Haim, Gal Vardi, Gilad Yehudai, Ohad Shamir, Michal Irani

This paper demonstrates that it is possible to reconstruct a large portion of the input data sets from the weights of a trained network on popular vision data sets, such as CIFAR.

The images reconstructed from the trained network often match actual data set samples well and, in terms of pixels, reconstructions are impressively precise. The algorithm for reconstruction is based on a theoretical insight for homogeneous neural networks, which characterises the learned network weights as the critical point of a constraint optimisation problem.

This insight was first introduced in the deep learning literature in the study of the implicit bias of gradient flows. The authors observe that this result allows writing the trained weights of the network as a linear combination of the gradients, with respect to the weights at each training data point in the data set.

They use this to set up an appropriate minimisation problem to reconstruct all critical training samples simultaneously. The assumption that the network will be homogenous is satisfied by ReLU networks without bias terms, for example, which are one-homogenous.

This paper shows a surprising result based on a theoretical insight into the training dynamics of neural networks. The result has potential implications for the discussion on privacy preservation in neural networks, demonstrating it is possible to extract actual training samples from the weights of trained neural networks.

Read all of our NeurIPS 2022 paper reviews

NeurIPS 2022: Paper review #2

25 Jan 2023

Our NeurIPS 2022 paper review series gives you the opportunity to hear about the research and papers that our quants and ML engineers found most interesting from the conference. Here a senior Quantitative Researcher at G-Research, discusses two papers from NeurIPS.

G-Research at NeurIPS 2022

We work on a very mature problem at GE Research predicting financial markets, which means we need to stay at the cutting edge of what we do. That's why events like noritz, where we've ATO tier sponsors are crucial for our business as they bring together the best machine learning practitioners to present and discuss the latest research and innovation in ml. We encourage our quant researchers and machine learning engineers to attend leading conferences in person to further develop their skills and stay abreast of the latest technological developments from some of the brightest minds in the industry. Additionally, a number of our talent acquisition team were on hand throughout the week to talk to attendees about what we do, including the various research and engineering roles we are currently hiring for, and we kept everyone fueled with the help of our head barista as busy as we were inside Europe as a headline sponsor. We also ran a number of events outside the conference hall during the week as well. Not least the first ever G Research boat party held on a classic paddle steamer on the Mississippi. What better way to bring together like-minded people in New Orleans? And we were delighted. So many people wanted to come along, As well as providing a unique networking opportunity. This event also gave us the chance to give our guests a flavor of what life is like. At Achieve research, we pride ourselves on cultivating an environment where smart people come together to challenge themselves, enjoy their work, and achieve things as a team. And there's also plenty of opportunity for fun along the way. You know what they say about all work and no play. Want to learn more about GE Research or meet us at a future event? Visit our website to find out more.

Open video transcript

NeurIPS 2022: Paper review #1

Focal Modulation Networks

Reconstructing Training Data from Trained Neural Networks

Read all of our NeurIPS 2022 paper reviews

G-Research at NeurIPS 2022

Stay up to date with G-Research

Stay up to date with
G-Research