François Charton · Nov 10, 2022 · 12:13 PM UTC

François Charton

François Charton

@f_charton

10 Nov 2022

My paper Linear Algebra with Transformers was published in Transactions of Machine Learning Research (TMLR). This new version includes many new results and experiments. openreview.net/pdf?id=Hp4g7F… The source code should be available in a few days.

131

781

François Charton · Oct 17, 2024 · 12:02 PM UTC

François Charton

@f_charton

17 Oct 2024

Transformers can be trained to solve a 132-years old open problem: discovering global Lyapunov functions. New paper on Arxiv (accepted in NeurIPS 2024), with @albe_alfa and @Amaury_Hayat arxiv.org/abs/2410.08304 1/8

123

671

73,347

François Charton · Mar 29, 2021 · 3:02 PM UTC

François Charton

@f_charton

29 Mar 2021

The source code, datasets and trained models for our paper "Learning Advanced Mathematical Computations from Examples", with @Amaury_Hayat and @GuillaumeLample, are now available at github.com/facebookresearch/…

GitHub - facebookresearch/MathsFromExamples: Source code, datasets and trained models for the paper...

Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample -...

github.com

120

509

François Charton · Nov 5, 2024 · 1:33 PM UTC

François Charton

@f_charton

5 Nov 2024

Transformers for discrete optimisation problems 1- Train a model on candidate solutions 2- Use the model to generate more candidates 3- Improve the solutions with local search 4- Use the best candidates to fine tune the model 5- Iterate

Jordan Ellenberg @JSEllenberg

5 Nov 2024

New preprint up! "PatternBoost: Constructions in Mathematics with a Little Help from AI," with F. Charton, A.Z. Wagner, and G. Williamson: arxiv.org/abs/2411.00566

418

142,324

François Charton · Oct 10, 2022 · 12:59 PM UTC

François Charton

@f_charton

10 Oct 2022

My talk at Physics ∩ ML last week piped.video/watch?v=81o-Uiop… Recent results on transformers learning mathematical properties (instead of just memorizing and interpolating) at 40:00. And a first attempt at particle physics, and gluons, at 57:00

Maths with transformers

François Charton, Meta AI

youtube.com

343

François Charton · Nov 22, 2022 · 1:45 PM UTC

François Charton

@f_charton

22 Nov 2022

The source code for my two papers "Linear Algebra with Transformers" (TMLR) arxiv.org/abs/2112.01898 and "What is my math transformer doing?" (NeurIPS 2022 Math-AI Workshop) arxiv.org/abs/2211.00170) is now available at github.com/facebookresearch/… (with trained models and test sets)

314

François Charton · Dec 5, 2024 · 7:55 PM UTC

François Charton

@f_charton

5 Dec 2024

The code for our paper: Global Lyapunov functions: a long-standing open problem in mathematics, with symbolic transformers, with @albe_alfa and @Amaury_Hayat is available at github.com/facebookresearch/… We will be in NeurIPS: come see us at the poster session next Thursday at 5PM

GitHub - facebookresearch/Lyapunov: PyTorch original implementation of "Global Lyapunov functions:...

PyTorch original implementation of "Global Lyapunov functions: a long-standing open problem in mathematics, with symbolic transformers" (NeurIPS 2024). - facebookresearch/Lyapunov

github.com

311

41,220

François Charton · Jun 6, 2022 · 12:47 PM UTC

François Charton

@f_charton

6 Jun 2022

Transformers can be trained to compute the roots of polynomials f-charton.github.io/polynomi… It is often said that language models "cannot compute", evidence to the contrary is accumulating.

302

François Charton · Sep 2, 2025 · 11:46 AM UTC

François Charton

@f_charton

2 Sep 2025

I am joining Axiom Math, a seed-stage startup on AI for maths. I will lead discovery: AI for advancing math research. 6 years after Deep Learning for Symbolic Maths, our first paper with @GuillaumeLample, I am proud of the field's progress, and excited about what comes next.

289

33,597

François Charton · Dec 6, 2021 · 1:27 PM UTC

François Charton

@f_charton

6 Dec 2021

Transformers can be trained to solve problems of linear algebra (matrix transposition, addition, multiplication, inversion and eigenvalues) to very high accuracy. 1/4 Our new paper is on Arxiv: arxiv.org/abs/2112.01898

257

François Charton · Sep 13, 2024 · 10:47 PM UTC

François Charton

@f_charton

13 Sep 2024

Transformers solve an open problem in symbolic mathematics: discovering Lyapunov functions, joint work with Alberto Alfarano and @Amaury_Hayat. My talk in IAIFI today (starts at 5:00) piped.video/watch?v=yCzV97QN…

IAIFI Colloquium: Transformers meet Lyapunov: Solving a long-standing...

François Charton, Research Engineer, MetaFriday, September 13, 202...

youtube.com

IAIFI @iaifi_news

13 Sep 2024

Our first IAIFI Colloquium of the semester is starting now with @f_charton! "Transformers meet Lyapunov: Solving a long-standing open problem in mathematics." Watch live on YouTube: piped.video/live/yCzV97QNG8w…

245

47,255

François Charton · Sep 28, 2023 · 12:46 PM UTC

François Charton

@f_charton

28 Sep 2023

My talk in Harvard yesterday. Transformers for symbolic regression (11:30), theoretical physics (26:00), and results on explainability in linear algebra (39:00) and arithmetic (50:00) piped.video/Sc6k06wVX3s?si=H_vf…

219

29,623

François Charton · May 27, 2024 · 3:00 PM UTC

François Charton

@f_charton

27 May 2024

My talk at the IHES workshop: Mathematics for and by large language models piped.video/watch?v=k9xLg-3W… and the full seminar on carmin.tv carmin.tv/en/c/1539 featuring talks by @Amaury_Hayat, @KempeLab, Yiannis Vlassopoulos, Andrew Dudzik and @syhw

Francois Charton - Mathematics as a Translation Task - the Importance...

Many problems of mathematics can be set as translation tasks: probl...

youtube.com

179

198,513

François Charton · Jul 2, 2024 · 7:30 AM UTC

François Charton

@f_charton

2 Jul 2024

Looking for a postdoctorate student to work with me on applying transformers to open problems in mathematics and theoretical physics. This is an 18 month position, based in Paris. DM me if interested. metacareers.com/jobs/7714042…

173

53,066

François Charton · Mar 26, 2021 · 4:39 PM UTC

François Charton

@f_charton

26 Mar 2021

Deep language models can predict mathematical properties of differential systems. The final version of Learning advanced mathematical computations from examples, our ICLR 2021 paper, with @Amaury_Hayat and @GuillaumeLample, is on Arxiv arxiv.org/abs/2006.06462

Learning advanced mathematical computations from examples

Using transformers over large generated datasets, we train models to learn mathematical properties of differential systems, such as local stability, behavior at infinity and controllability. We...

arxiv.org

152

François Charton · Dec 16, 2024 · 4:14 PM UTC

François Charton

@f_charton

16 Dec 2024

One epoch is not all you need! Our paper, Emergent properties with repeated examples, with @KempeLab, won the NeurIPS24 Debunking Challenge, organized by the Science for Deep Learning workshop, @scifordl arxiv.org/abs/2410.07041

122

9,776

François Charton · Mar 25, 2024 · 1:26 PM UTC

François Charton

@f_charton

25 Mar 2024

The source code for my paper: Learning the greatest common divisor: explaining transformer predictions (arxiv.org/abs/2308.15594, ICLR 2024 spotlight) is now available on github.com/facebookresearch/….

117

42,776

François Charton · Oct 10, 2024 · 12:09 PM UTC

François Charton

@f_charton

10 Oct 2024

Math transformers learn better when trained from repeated examples. New paper with @KempeLab arxiv.org/html/2410.07041v1 On 3 problems, modular multiplication, GCD and eigenvalues, for the same training budget, models trained from smaller datasets achieve better performances. 1/5

120

28,219

François Charton · Nov 2, 2022 · 12:55 PM UTC

François Charton

@f_charton

2 Nov 2022

What is my math transformer doing? Three results on interpretability and generalization. When trained to solve numerical problems from examples, transformers learn some of the underlying maths, and can generalize way out of distribution. New preprint: arxiv.org/abs/2211.00170

116

François Charton · Jan 10, 2023 · 12:30 PM UTC

François Charton

@f_charton

10 Jan 2023

My presentation last Friday in the Collège de France (in French). Transformers learning maths, and recent results on explainability, and why transformers sometimes cheat. piped.video/RtB8kVCxJdw via @YouTube

110

31,655

François Charton · Apr 27, 2022 · 2:25 PM UTC

François Charton

@f_charton

27 Apr 2022

Our new paper on Symbolic Regression with @pa_kamienny @stephanedascoli @GuillaumeLample is now on Arxiv ! We achieve performance comparable to SOTA genetic algorithms on SRBench with Transformers, whose inference time is orders of magnitude lower! arxiv.org/abs/2204.10532 1/4

113

François Charton · Nov 12, 2024 · 4:47 PM UTC

François Charton

@f_charton

12 Nov 2024

How do transformers learn arithmetic tasks, such as GCD and modular sums and products? My talk in Collège de France on November 4th (in French, but the English subtitles are quite good). Thank you @wtgowers for inviting me to your seminar! piped.video/watch?v=e0jUi8W4…

107

36,480

François Charton · Aug 31, 2023 · 11:49 AM UTC

François Charton

@f_charton

31 Aug 2023

Transformers can learn to compute the greatest common divisor of two positive integers. They make deterministic predictions that can be fully explained. Training from a log-uniform distribution of operands achieves best results. My new paper is on arXiv: arxiv.org/abs/2308.15594

15,045

François Charton · Jan 14, 2022 · 4:39 PM UTC

François Charton

@f_charton

14 Jan 2022

Transformers can discover recurrence relations from sequences (aka IQ tests). New paper on symbolic regression, with @stephanedascoli @pa_kamienny and @GuillaumeLample

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

14 Jan 2022

Deep Symbolic Regression for Recurrent Sequences -- arxiv.org/abs/2201.04600 We show that transformers are great at predicting symbolic functions from values, and can predict the recurrence relation of sequences better than Mathematica. You can try it here: bit.ly/3niE5FS

François Charton · Feb 13, 2023 · 2:01 PM UTC

François Charton

@f_charton

13 Feb 2023

Leveraging maths to understand transformers. Transformers learning maths, or sometimes just pretending. A presentation at the NeurIPS 2022 MATH-AI workshop. neurips.cc/virtual/2022/work…

7,711

François Charton · Jan 14, 2025 · 2:20 PM UTC

François Charton

@f_charton

14 Jan 2025

A pure physics paper based on intuitions from AI experiments, expect more of these!

Kyle Cranmer @KyleCranmer

13 Jan 2025

I’m pretty excited about our new paper, which is a follow up to our last paper using AI to help solve a problem in theoretical particle physics. (With Lance, @f_charton, Matthias, Tianji, and @merz_garrett

3,058

François Charton · Oct 27, 2024 · 11:34 AM UTC

François Charton

@f_charton

27 Oct 2024

Our Lyapunov paper in the New Scientist. Thanks @stokel !

Chris Stokel-Walker @stokel

22 Oct 2024

An AI system has helped tackle a longstanding tough mathematical problem involving tools called Lyapunov functions. My latest for @newscientist newscientist.com/article/245…

5,485

François Charton · Jan 29, 2022 · 4:05 PM UTC

François Charton

@f_charton

29 Jan 2022

Great video about our work on symbolic regression! Thanks @ykilcher and @stephanedascoli

Yannic Kilcher 🇸🇨

@ykilcher

29 Jan 2022

📜Paper Video Time!📜Today I'm talking to Stéphane d'Ascoli (@stephanedascoli) about Deep Symbolic Regression for Recurrent Sequences. This model is given a sequence of numbers, like 1, 2, 3, 5, 8 and it figures out the *rule behind* the sequence. Insane🤯 piped.video/1HEdXwEYrGM

ALT Deep Symbolic Regression with the author

François Charton · Feb 16, 2024 · 12:54 PM UTC

François Charton

@f_charton

16 Feb 2024

I am looking for a research engineer (code and experiments), to work on scientific reasoning, with Julia Kempe, Yann Ollivier, and me metacareers.com/jobs/3577161…

7,375

François Charton · Sep 20, 2023 · 10:02 AM UTC

François Charton

@f_charton

20 Sep 2023

Presenting current work and recent results at Harvard on the 27th

6,092

François Charton · Oct 17, 2024 · 12:02 PM UTC

François Charton

@f_charton

17 Oct 2024

Human mathematicians (masters students) achieve less than 10% accuracy on this task (vs more than 80% for our model). 8/8

2,549

François Charton · Mar 9, 2023 · 6:20 PM UTC

François Charton

@f_charton

9 Mar 2023

SALSA PICANTE: a machine learning attack on LWE with binary secrets. Transformers can be trained to recover secrets from public-key cryptosystems. New preprint arxiv.org/abs/2303.04178, with @CathyYLi, @JSotakova, @em_wenger, Mohamed Mahlou, Evrard Garcelon, and @KristinLauter.

4,946

François Charton · Apr 22, 2023 · 10:04 AM UTC

François Charton

@f_charton

22 Apr 2023

"A very perverse translation task," my work on transformers and linear algebra (arxiv.org/abs/2112.01898), discussed by mathematician Geordie Williamson (43:20 onwards) piped.video/trEY6c7eogQ

3,933

François Charton · Sep 18, 2024 · 12:34 PM UTC

François Charton

@f_charton

18 Sep 2024

Transformers for amplitudes, a first step towards using symbolic language models in theoretical physics - with @KyleCranmer @merz_garrett Tianji Cai, Matthias Wilhelm and Lance Dixon

Machine Learning: Science and Technology @MLSTjournal

17 Sep 2024

Great new work by @merz_garrett @KyleCranmer @f_charton et al @SLAClab @datascience_uw @AIatMeta @UCPH_Research-'Transforming the bootstrap:using #transformers...in planar N=4 super Yang–Mills theory'-iopscience.iop.org/article/1… #machinelearning #HEP #particlephysics #LHC #QCD #Higgs

2,387

François Charton · Feb 19, 2024 · 6:25 AM UTC

François Charton

@f_charton

19 Feb 2024

Transformers can learn string rewrites, if trained on a large and diverse set of rewrite rules. The number of rules matters, not the number of examples per rule. String rewrites form the basis of Markov algorithms. If transformers can learn them, they can learn any calculation.

Dylan Zhang

@dylan_works_

19 Feb 2024

I have been curious about the driving factors of generalization to unseen instructions. - so we therefore attempted to model this phenomenon with a symbolic task. - string rewrites. arxiv.org/pdf/2402.10891.pdf Happy to work with @f_charton and my reliable collaborator Justin on this study.

4,254

François Charton · Mar 4, 2020 · 8:35 PM UTC

François Charton

@f_charton

4 Mar 2020

Today is my first year in Facebook, and my first year working as a researcher. Thanks to all who made this possible, I had a blast!

François Charton · Sep 3, 2020 · 7:26 PM UTC

François Charton

@f_charton

3 Sep 2020

Today was my last day as a visiting entrepreneur in Facebook AI. I am amazed at how much I have learnt during those 18 months. Thanks everyone, I had a blast!

François Charton · Mar 26, 2025 · 11:35 AM UTC

François Charton

@f_charton

26 Mar 2025

Open sourcing Int2Int, a Python code base for AI for maths, with a special focus on arithmetic and number theory github.com/f-charton/Int2Int A user manual, and instructions on how to extend it, can be found here arxiv.org/abs/2502.17513

1,290

François Charton · Mar 29, 2021 · 3:02 PM UTC

François Charton

@f_charton

29 Mar 2021

The repository includes source code for data generation, model training, and evaluation of trained models. Since data generation is VERY compute-intensive, we have built 7 datasets, from 20 to 100M examples. We also include 7 pre-trained models.

François Charton · Jul 13, 2022 · 6:32 PM UTC

François Charton

@f_charton

13 Jul 2022

Transformers for cryptanalysis Our new paper, SALSA: Attacking Lattice Cryptography with Transformers, with @em_wenger, Mingjie Chen and @KristinLauter, is on ArXiv arxiv.org/abs/2207.04785

François Charton · Oct 10, 2024 · 12:09 PM UTC

François Charton

@f_charton

10 Oct 2024

For modular multiplication, models trained on 100 million different examples or more do not learn the task. Models trained on 25 or 50 million examples can achieve 100% accuracy. 2/5

19,904

François Charton · Oct 15, 2021 · 10:29 AM UTC

François Charton

@f_charton

15 Oct 2021

Thank you, #NeurIPS2021 for the outstanding reviewer award. This was my first time reviewing research papers, so it means a lot to me.

François Charton · Oct 17, 2024 · 12:02 PM UTC

François Charton

@f_charton

17 Oct 2024

Global Lyapunov functions control the stability of dynamical systems: whether a system starting close to an equilibrium always stays close to the equilibrium (or diverges away). A famous case is the three-body problem: the stability of three celestial bodies under gravitation 2/8

2,322

François Charton · Dec 15, 2019 · 12:48 PM UTC

François Charton

@f_charton

15 Dec 2019

A very good introduction to our paper, which now looks embarrassingly simple... towardsdatascience.com/deep-…

François Charton · Oct 17, 2024 · 12:02 PM UTC

François Charton

@f_charton

17 Oct 2024

Performance is greatly improved by adding a tiny number (0.03%) of easy and solvable "forward" examples (systems for which we have solutions) to the backward training set. Such "primed models" outperform state of the art methods by a large margin. 6/8

1,308

François Charton · Nov 25, 2024 · 8:29 PM UTC

François Charton

@f_charton

25 Nov 2024

Thank you for having me @KyleCranmer and @gary_shiu Research featured in the talk: discovering Lyapunov functions (9:55), PatternBoost: generative models in combinatorics (31:20), Scattering amplitudes (44:10), Arithmetic, repetition, and a few unpublished results (46:40)

datascience@uw @datascience_uw

25 Nov 2024

We had a great turnout for our inaugural AI for Science seminar with François Charton last week. If you missed it, check out the recording: mediaspace.wisc.edu/media/Xu… @KyleCranmer

2,734

François Charton · Jun 18, 2023 · 7:57 AM UTC

François Charton

@f_charton

18 Jun 2023

Videos from the three-day National Academies workshop on AI for mathematics, held last week.nationalacademies.org/event/…. A welcome change from the big claims over dubious test sets we have seen lately.

AI to Assist Mathematical Reasoning: A Workshop

A National Academies of Sciences, Engineering, and Medicine-appointed ad hoc committee will plan and organize a workshop that will bring together academic, industry, and government stakeholders to...

nationalacademies.org

3,195

François Charton · Oct 17, 2024 · 12:02 PM UTC

François Charton

@f_charton

17 Oct 2024

We tested our models on sets of random dynamical systems, the stability of which is unknown, and could find new Lyapunov functions in 10 to 13% of the cases. (7/8)

2,082

François Charton · Oct 17, 2024 · 12:02 PM UTC

François Charton

@f_charton

17 Oct 2024

In 1892, Lyapunov showed that global stability was guaranteed if a function V could be found, with a strict minimum at the equilibrium, infinite at infinity, and a gradient always pointing away from the system gradient. Unfortunately, he provided no method for finding V. 3/8

1,619

François Charton · Oct 17, 2024 · 12:02 PM UTC

François Charton

@f_charton

17 Oct 2024

No general method exists for finding a Lyapunov function. To train our models, we introduce a backward generation technique that creates dynamical systems from their Lyapunov functions. These systems have a different distribution from the problems we actually want to solve. 4/8

1,417

François Charton · Jul 18, 2022 · 10:21 PM UTC

François Charton

@f_charton

18 Jul 2022

The source code for our ICML 2022 paper Deep Learning for Recurrent Sequences (arxiv.org/abs/2201.04600) is now available on github.com/facebookresearch/…. Spotlight: Wednesday 20, 16:50 ET Poster session: Wednesday 20, 18:30 ET @stephanedascoli @pa_kamienny @GuillaumeLample

François Charton · Jun 12, 2020 · 2:57 PM UTC

François Charton

@f_charton

12 Jun 2020

Our paper about leaning properties of differential systems with transformers is on Arxiv. Even on very advanced math computations, NLP models work surprisingly well. arxiv.org/abs/2006.06462

Learning advanced mathematical computations from examples

Using transformers over large generated datasets, we train models to learn mathematical properties of differential systems, such as local stability, behavior at infinity and controllability. We...

arxiv.org

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

12 Jun 2020

Could neural networks find alternatives to classical theories? We show that they can predict abstract mathematical properties of systems involving advanced notions like Fourier transforms, Jacobians, integration. 1/4 arxiv.org/abs/2006.06462 with @Amaury_Hayat and @f_charton

François Charton · Aug 5, 2025 · 11:18 AM UTC

François Charton

@f_charton

5 Aug 2025

Congratulations Jeremy! And long live AI4Maths !

Carnegie Mellon University @CarnegieMellon

4 Aug 2025

Replying to @CarnegieMellon

“The institute will focus on the mathematical components of these tasks and use the technologies to support mathematical reasoning and computation in all its applications,” said Jeremy Avigad, director of ICARM.

3,738

François Charton · May 13, 2024 · 2:17 PM UTC

François Charton

@f_charton

13 May 2024

Transformers for amplitude bootstrap, a hard problem in theoretical physics. A fun collaboration with SLAC, UWisconsin-Madison and Niels Bohr Institute #ai4science #ai4maths

Kyle Cranmer @KyleCranmer

13 May 2024

New paper out! We are using transformers to make progress in a cutting-edge problem in theoretical / mathematical physics. @GarrettMerz @datascience_uw, Lance Dixon & Tianji Cai @SLAClab, @f_charton & Niklas Nolte @AIatMeta, Mattias Wilhelm @UCPH_Research arxiv.org/abs/2405.06107

3,284

François Charton · Dec 4, 2019 · 2:48 PM UTC

François Charton

@f_charton

4 Dec 2019

Transformers work wonders on natural language. Given enough examples, they can translate without a dictionary. Why not consider mathematics as a language and problem solving as translation tasks? with @GuillaumeLample, arxiv.org/abs/1912.01412

François Charton · Oct 17, 2024 · 12:02 PM UTC

François Charton

@f_charton

17 Oct 2024

Yet, models trained on backward-generated data achieve good performance on test sets of polynomial systems that can be solved with numerical tools, despite having to generalize out-of-distribution. 5/8

1,341

François Charton · May 6, 2024 · 10:50 AM UTC

François Charton

@f_charton

6 May 2024

On my way to ICLR, want to talk about AI for maths and physics? Ping me!

2,389

François Charton · Jun 13, 2020 · 5:23 PM UTC

François Charton

@f_charton

13 Jun 2020

A very clear and insightful account of our paper: Deep Differential System Stability

Yannic Kilcher 🇸🇨

@ykilcher

13 Jun 2020

This LANGUAGE MODEL determines stability properties of differential systems, a task that usually requires multiple steps of high-level math and at least three grad students! 😮 watch the video here piped.video/l12GXD0t_RE @f_charton @Amaury_Hayat @GuillaumeLample @facebookai

François Charton · Jun 12, 2024 · 9:37 AM UTC

François Charton

@f_charton

12 Jun 2024

Human feedback, or an external verifier, applied to generated training data, can prevent model collapse.

Julia Kempe

@KempeLab

12 Jun 2024

How to leverage AI-synthesized data without catastrophic degradation? Rank-and-prune feedback, from humans or even weaker models, provably restores and even surpasses original performance! See arxiv.org/abs/2406.07515 @AIatMeta @feeelix_feng @dohmatobelvis @f_charton @yangpuPKU

2,112

François Charton · Jul 14, 2025 · 8:04 AM UTC

François Charton

@f_charton

14 Jul 2025

Two lessons I gave in March, for the Journées de Calcul Formel (Francophone Computer Algebra Days), in Luminy. Lesson one on AI and mathematical discovery (integration, differential system stability, combinatorics) piped.video/watch?v=ZTmltujo…

2,433

François Charton · Feb 18, 2024 · 9:55 AM UTC

François Charton

@f_charton

18 Feb 2024

Replying to @francoisfleuret

1- train loss: must drop (or you have a bug), fast (or lr too small), stable (or lr too large) 2- speed/mem/gpu usage: dataloaders fast (num_workers), all cores busy (batch size), no back and forth between cpu and gpu memory (bug) 3- can you use fp16, b16, or lower precision?

925

François Charton · Jul 22, 2021 · 5:45 PM UTC

François Charton

@f_charton

22 Jul 2021

Replying to @ilyasut

Maths are the free lunch, deep learning, just a recipe

François Charton · Feb 10, 2025 · 8:11 AM UTC

François Charton

@f_charton

10 Feb 2025

All hail the European labor laws!

1,549

François Charton · Nov 10, 2023 · 5:39 PM UTC

François Charton

@f_charton

10 Nov 2023

Can transformers learn Planar N=4 Supersymmetric Yang-Mills?

Kyle Cranmer @KyleCranmer

10 Nov 2023

Just finished an intense week @SLAClab with our small collaboration focusing on using AI to aid in state of the art theoretical physics calculations. 4 days, no talks, only blackboard, code, and results. @datascience_uw @AIatMeta

ALT six collaborators standing in front of a messy blackboard with many equations

2,274

François Charton · Mar 12, 2023 · 2:00 PM UTC

François Charton

@f_charton

12 Mar 2023

Most of the time, we use clarity as a proxy for truth, because we believe that we only express clearly what we understand well. Unfortunately, the self-supervised techniques used to train language models seem to do a much better job making them clear, than making them true.

1,346

François Charton · Apr 21, 2024 · 12:31 PM UTC

François Charton

@f_charton

21 Apr 2024

We are not there yet... Note the interesting failure pattern: the answer is wrong (should be 23), but the divisors of both operands, provided as justification, are correct, and a correct but irrelevant comment (23 is prime) is added for good measure.

1,932

François Charton · Dec 6, 2023 · 4:22 PM UTC

François Charton

@f_charton

6 Dec 2023

In NeurIPS next week, excited to chat about possible collaborations and new opportunities in AI for maths, physics and reasoning. DM me if you would like to meet.

1,257

François Charton · May 20, 2020 · 8:18 PM UTC

François Charton

@f_charton

20 May 2020

Our work with @GuillaumeLample is featured in Quanta magazine quantamagazine.org/symbolic-…

François Charton · Feb 23, 2020 · 1:10 AM UTC

François Charton

@f_charton

23 Feb 2020

Replying to @ameliovr @ylecun

Usually, because intermediate calculations become too complex: too long/deep formulas, too many branches. For integration, the Risch algorithm should handle 100% of cases, but it is very difficult to implement fully.

François Charton · Mar 30, 2024 · 7:40 AM UTC

François Charton

@f_charton

30 Mar 2024

Replying to @francoisfleuret

The book is very good, but very chinese, a westernized adaptation cannot work

1,953

François Charton · Dec 18, 2019 · 9:58 PM UTC

François Charton

@f_charton

18 Dec 2019

We are featured today on Popular Mechanics popularmechanics.com/science…

Facebook's Neural Net Can Solve This Differential Equation in One Second

Calculus just got a whole lot easier.

popularmechanics.com

François Charton · Aug 19, 2025 · 12:23 PM UTC

François Charton

@f_charton

19 Aug 2025

Replying to @francoisfleuret

Scaling up old ideas, with 10x the compute and a fancy acronym

1,138

François Charton · Aug 9, 2020 · 12:32 PM UTC

François Charton

@f_charton

9 Aug 2020

An interview about our work with @GuillaumeLample , published a while ago on the news feed of the American Mathematical Society (thank you @writesRCrowell ) ams.org/news?news_id=6207

François Charton · Dec 9, 2021 · 6:50 PM UTC

François Charton

@f_charton

9 Dec 2021

Can you know if a metabolic network has an equilibrium and which ? Transformers can ! We predict graph equilibriums and their associated flows with very high precision. 1/4 New paper on Arxiv arxiv.org/abs/2112.03588 with @Amaury_Hayat @RutgersCCIB @Rutgers_Camden

François Charton · Nov 5, 2024 · 6:21 PM UTC

François Charton

@f_charton

5 Nov 2024

Replying to @kchonyc @_angie_chen @samuel_stanton_

This is somewhat related to arxiv.org/abs/2406.07515 Adding some “truth signal” (local search, external verification) to generated data allows one to feed it back into the model, without triggering model collapse

890

François Charton · Jan 14, 2025 · 3:04 PM UTC

François Charton

@f_charton

14 Jan 2025

Expériences conduisant aux intuitions. Les résultats des expériences avec des transformers nous ont indiqué où regarder.

647

François Charton · Jun 25, 2023 · 10:36 AM UTC

François Charton

@f_charton

25 Jun 2023

Replying to @TaliaRinger

And, sometimes, you don't post the draft on Arxiv, because you think it is not ready, but you share your ideas with reputable researchers, just to find, later on, the same idea, with the exact same name, in a preprint from the same reputable researchers (and no acknowlegment).

2,133

François Charton · Oct 6, 2022 · 8:54 PM UTC

François Charton

@f_charton

6 Oct 2022

Our project on using transformers to understand the scattering amplitudes of gluons was awarded a grant!

Kyle Cranmer @KyleCranmer

6 Oct 2022

Woot! Lance Dixon (@SLAClab) & I (@UWMadPhysics @datascience_uw) have been awarded a grant from @doescience to use AI to take on a challenging problem theoretical particle physics. We will team up with @f_charton (@MetaAI) & Matthias Wilhelm (Niels Bohr) energy.gov/science/articles/…

François Charton · Jun 17, 2024 · 2:57 PM UTC

François Charton

@f_charton

17 Jun 2024

My talk in Amplitudes 2024, at @the_IAS, recent work on transformers for theoretical physics begins at 15:00. Thank you Nima Arkani-Hamed, Jacob Nourjaily, Hofie Hannesdottir and Sebastian Mizera for inviting me. piped.video/watch?v=kbkm61hW…

Transformers for Bootstrapperd Amplitudes- Francois Charton

Amplitudes 2024Topic: Machine learning the bootstrapped amplitude...

youtube.com

1,547

François Charton · Jan 14, 2020 · 6:13 PM UTC

François Charton

@f_charton

14 Jan 2020

We are featured on Engadget engadget.com/2020/01/14/face…

Facebook taught its AI to speak math - Engadget

I speak two languages, English and Bad English. My understanding of math is significantly worse. In fact, I had to redo Calculus 2A four different times in college in order to graduate, mostly...

engadget.com

François Charton · Oct 3, 2023 · 8:24 PM UTC

François Charton

@f_charton

3 Oct 2023

Replying to @AlbertQJiang @Yuhu_ai_ @jimmybajimmyba

We are still a niche, but a larger one.

305

François Charton · Mar 26, 2021 · 4:39 PM UTC

François Charton

@f_charton

26 Mar 2021

This new version includes baselines and experiments with out-of-distribution generalization, showing that models trained on systems of 2 to 5 equations can predict the properties of larger systems (6 equations, or longer expressions) with high accuracy.

François Charton · Jun 9, 2020 · 11:33 AM UTC

François Charton

@f_charton

9 Jun 2020

Same here, pdf was US letter, but the previewer I used to cut supplementary material converted it to A4 (european defaults). So, the margins are correct, the text is formatted as required, the only thing wrong is the amount of white space around it... seriously @neuripsconf ?

François Charton · Jul 14, 2025 · 8:04 AM UTC

François Charton

@f_charton

14 Jul 2025

Lesson 2 on AI for arithmetic, and maths for interpretability piped.video/watch?v=4PuJitS_…

1,101

François Charton · Jun 9, 2025 · 9:44 AM UTC

François Charton

@f_charton

9 Jun 2025

Replying to @aaron_defazio

In AI? AI for Science (maths, theoretical physics), this is the next frontier

630

François Charton · Oct 12, 2024 · 2:41 PM UTC

François Charton

@f_charton

12 Oct 2024

Diversity helps generalization. New paper with @dylan_works_ and Justin Wang

Dylan Zhang

@dylan_works_

11 Oct 2024

"Only-IF: Revealing the Decisive Effect of Instruction Diversity on Generalization" arxiv.org/pdf/2410.04717 We isolated 'instruction-following' ability (apart from complex reasoning like math) and designed various controlled experiments to show that -

1,491

François Charton · Mar 23, 2020 · 5:45 PM UTC

François Charton

@f_charton

23 Mar 2020

Code and datasets for our paper on Symbolic Mathematics are now available

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

23 Mar 2020

The code for our @iclr_conf paper, Deep Learning for Symbolic Mathematics, is now available in @PyTorch! We also provide our datasets and pretrained models Code: github.com/facebookresearch/… Paper: arxiv.org/abs/1912.01412

François Charton · Jul 9, 2020 · 11:50 AM UTC

François Charton

@f_charton

9 Jul 2020

Leveraging multivariate observations to discover causal/covariant features in a very noisy environment. My first research paper.

Jean-Rémi King

@JeanRemiKing

9 Jul 2020

Back-to-back regression: Disentangling the influence of correlated factors from multivariate observations. Our latest paper with @f_charton, David Lopez Paz & Maxime Oquab at @facebookai is now freely available at Neuroimage: sciencedirect.com/science/ar… Here's the summary thread ⤵️

François Charton · Apr 27, 2022 · 2:25 PM UTC

François Charton

@f_charton

27 Apr 2022

Our model is trained on a vast dataset of synthetic examples, and scales to input dimensions up to ten. After several million examples, the attention maps start to reveal intricate mathematical analysis : in the example f(x)=sin(x)/x below, we see Fourier-like patterns. 4/4

François Charton · Dec 6, 2019 · 9:54 PM UTC

François Charton

@f_charton

6 Dec 2019

Replying to @bhaveshshrima11 @GuillaumeLample

We selected functions from our test set that we could solve but Maple, Matlab and Mathematica could not. From this set, we chose the most "photogenic".

François Charton · Feb 23, 2020 · 9:45 AM UTC

François Charton

@f_charton

23 Feb 2020

Replying to @mkagenius @basit_ayantunde @facebookai

Table 4 on page 9 of our paper has a few. Table 7, page 11, has more interesting cases. There, the model was trained exclusively on functions SymPy can integrate. Yet it could solve problems that SymPy could not. So much for claims that we are overfitting.

François Charton · Nov 30, 2024 · 5:04 PM UTC

François Charton

@f_charton

30 Nov 2024

Replying to @NeelNanda5

Not quite mech interp, but would love to meet (in NeurIPS from Thursday to Sunday)

961

François Charton · Dec 6, 2021 · 1:27 PM UTC

François Charton

@f_charton

6 Dec 2021

Given enough examples, models trained on random matrices with independent and identically distributed (iid) coefficients (Wigner matrices) can predict with high precision. 2/4

François Charton · Mar 13, 2022 · 8:01 AM UTC

François Charton

@f_charton

13 Mar 2022

Replying to @ChrSzegedy @giffmana @GuillaumeLample

On par with a thermos bottle, according to other experts...

François Charton · Dec 17, 2022 · 11:05 AM UTC

François Charton

@f_charton

17 Dec 2022

French cuisine with Chat GPT, a three course réveillon for Christmas... (Experiment at your own risk) Pour commencer: the Brie en Croute à la Matelote

1,852

François Charton · Sep 6, 2025 · 6:47 AM UTC

François Charton

@f_charton

6 Sep 2025

Replying to @francoisfleuret

This is why you use Adam (or others): the average of successive directions is a better strategy than the local direction, which is influenced by local bumpiness, and warm-up: initial bounces can be misleading, let's make them shorter

497

François Charton · Feb 18, 2020 · 9:05 PM UTC

François Charton

@f_charton

18 Feb 2020

Presenting our paper in AISC tonight piped.video/8WmWwpflB7g

Deep Learning for Symbolic Mathematics | AISC

For slides and more information on the paper, visit https://aisc.ai...

youtube.com

François Charton · Apr 14, 2021 · 3:09 PM UTC

François Charton

@f_charton

14 Apr 2021

Replying to @AICoffeeBreak @Abel_TorresM @ChrSzegedy @GuillaumeLample @facebookai @MLStreetTalk

Actually this is how mathematics is done. A theorem usually begins as a wild guess with no guarantee of correctness, that one then tries to prove formally. And even when a counter exemple is found, the typical result is to change the wild guess a little, instead of rejecting it.

François Charton · Dec 21, 2021 · 6:14 AM UTC

François Charton

@f_charton

21 Dec 2021

Replying to @Panda31808732

Vu la période d'enquête, les 2d >6mois et 3d sont essentiellement des personnes âgées où à risque, non? (6 mois avant le 5 décembre c'est une deuxième dose avant le 5 juin, et une première avant le 25 avril).

François Charton · Apr 14, 2021 · 12:13 PM UTC

François Charton

@f_charton

14 Apr 2021

Replying to @Abel_TorresM @ChrSzegedy @AICoffeeBreak @GuillaumeLample @facebookai

There are two parts in problem solving: finding a candidate and proving it correct. Our paper addresses the first part. In a real-world application, verification would need to be implemented , but we believe this is a much simpler task.