Unlock filters, categories &

Artificial Intelligence Paper Rankings

154 of 3489 papers
#
Paper
Score
Match
Win%
Published
1
The Impossibility of Eliciting Latent Knowledge
Korbinian Friedl, Francis Rhys Ward +3
1573
27
77.8%
Jun 10, 2026
3
1553
15
80%
Jun 8, 2026
4
ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecurity
Andrew Bo Liu, Samira Nedungadi +4cs.CY
1552
18
83.3%
Jun 9, 2026
5
PRISM: Recovering Instruction Sets from Language Model Activations
Gilad Gressel, Rahul Pankajakshan +4
1535
23
87%
Jun 8, 2026
6
Can AI Agents Synthesize Scientific Conclusions?
Hayoung Jung, Pedro Viana Diniz +6cs.CLcs.CY
1527
22
86.4%
Jun 9, 2026
8
1515
25
80%
Jun 5, 2026
9
1514
21
81%
Jun 9, 2026
11
1505
20
75%
Jun 8, 2026
12
1504
19
73.7%
Jun 9, 2026
13
1500
24
83.3%
Jun 9, 2026
15
1494
30
50%
Jun 8, 2026
16
1493
21
76.2%
Jun 9, 2026
17
1489
22
72.7%
Jun 8, 2026
18
1480
29
55.2%
Jun 10, 2026
19
1479
20
75%
Jun 9, 2026
20
1479
20
75%
Jun 10, 2026
21
1477
19
63.2%
Jun 9, 2026
23
1465
24
75%
Jun 5, 2026
24
1464
19
73.7%
Jun 8, 2026
25
1463
27
48.1%
Jun 9, 2026
26
1463
17
70.6%
Jun 9, 2026
27
1461
24
75%
Jun 5, 2026
31
ComplexConstraints and Beyond: Expert Rubrics for RLVR
Sushant Mehta, Liudas Panavas +1
1454
23
73.9%
Jun 8, 2026
32
Online Pandora's Box for Contextual LLM Cascading
Alexandre Belloni, Yan Chen +1econ.EM
1452
19
68.4%
Jun 5, 2026
33
1449
18
66.7%
Jun 8, 2026
34
1447
16
50%
Jun 8, 2026
35
Forecasting Future Behavior as a Learning Task
Mosh Levy, Yoav Goldberg +1
1446
15
66.7%
Jun 9, 2026
38
1435
24
70.8%
Jun 9, 2026
41
1429
28
35.7%
Jun 8, 2026
42
1428
20
60%
Jun 5, 2026
46
Emergent alignment and the projectability of ethical personas
Guillermo Del Pinal, Youngchan Lee +2
1417
18
66.7%
Jun 8, 2026
47
1415
21
52.4%
Jun 8, 2026
48
1415
15
53.3%
Jun 9, 2026
51
Superficial Beliefs in LLM Decision-Making
Gabriel Freedman, Francesca Toni
1412
24
54.2%
Jun 9, 2026
52
Search Discipline for Long-Horizon Research Agents
Adithya Srinivasan, Devesh Paragiri
1412
15
60%
Jun 9, 2026
53
1411
22
54.5%
Jun 5, 2026
55
1410
20
55%
Jun 8, 2026
58
1405
20
70%
Jun 8, 2026
60
1402
20
55%
Jun 8, 2026
61
Off-Policy Evaluation with Strategic Agents via Local Disclosure
Kiet Q. H. Vo, Abbavaram Gowtham Reddy +3
1401
19
57.9%
Jun 5, 2026
63
Collaborative Human-Agent Protocol (CHAP)
Arsalan Shahid, Gordon Suttie +1cs.CLcs.HC
1397
17
64.7%
Jun 8, 2026
64
1397
17
52.9%
Jun 8, 2026
65
The Role of Feedback Alignment in Self-Distillation
Semih Kara, Oğuzhan Ersoy
1396
17
52.9%
Jun 9, 2026
66
1394
20
60%
Jun 9, 2026
68
1393
29
55.2%
Jun 10, 2026
69
1393
21
57.1%
Jun 8, 2026
70
1393
18
50%
Jun 8, 2026
72
1391
19
57.9%
Jun 9, 2026
74
1390
19
57.9%
Jun 9, 2026
75
Correlation Is Not Enough: Embedding Human Metadata for Individual Causal Discovery
Suraj Biswas, Saurabh Gupta +1cs.CLcs.PF
1387
18
55.6%
Jun 8, 2026
78
1376
17
58.8%
Jun 9, 2026
79
1375
21
52.4%
Jun 8, 2026
84
1369
14
57.1%
Jun 10, 2026
88
1360
13
38.5%
Jun 10, 2026
90
1359
16
43.8%
Jun 8, 2026
91
1357
21
42.9%
Jun 9, 2026
92
MoCA-Agent: A Market-of-Claims Code Agent for Financial and Numerical Reasoning
Abdelrahman Abdallah, AbdelRahim A. Elmadany +4cs.CE
1355
19
47.4%
Jun 10, 2026
96
1355
17
35.3%
Jun 9, 2026
98
1353
23
43.5%
Jun 8, 2026
99
1352
16
50%
Jun 9, 2026
101
1343
17
35.3%
Jun 10, 2026
103
1342
20
40%
Jun 8, 2026
105
1340
22
40.9%
Jun 9, 2026
107
1338
13
53.8%
Jun 9, 2026
108
1337
15
53.3%
Jun 8, 2026
110
Sim2Schedule: A Simulator-Guided LLM Framework for Autonomous Open-Pit Mine Scheduling
Mustavi Ibne Masum, Thiago Eustaquio Alves de Oliveira +1
1334
21
42.9%
Jun 9, 2026
112
1326
21
38.1%
Jun 9, 2026
117
1319
27
29.6%
Jun 8, 2026
122
1298
18
27.8%
Jun 8, 2026
123
Accelerating NeurASP with vectorization and caching
Alexander Philipp Rader, Alessandra Russocs.LO
1293
25
28%
Jun 9, 2026
124
1292
20
30%
Jun 9, 2026
125
1290
20
30%
Jun 9, 2026
127
1289
19
36.8%
Jun 5, 2026
129
1285
18
22.2%
Jun 10, 2026
132
Declarative Skills for AI Agents in Knowledge-Grounded Tool-Use Workflows
M. Danish Lim, I. Danial Bin Sharudin +3cs.SE
1284
21
33.3%
Jun 5, 2026
134
1275
25
28%
Jun 5, 2026
136
1267
24
29.2%
Jun 9, 2026
139
1255
23
17.4%
Jun 10, 2026
140
When Do Data-Driven Systems Exhibit the Capability to Infer?
Maximilian Poretschkin, Tabea Naeven
1250
17
29.4%
Jun 10, 2026
141
Frequency-based Constrained Sampling for Interval Patterns
Djawad Bekkoucha, Abdelkader Ouali +1
1248
35
17.1%
Jun 8, 2026
142
1245
19
21.1%
Jun 10, 2026
143
Towards Responsibly Non-Compliant Machines
Marija Slavkovik, Marie Farrell +4
1244
18
22.2%
Jun 10, 2026
144
1233
16
18.8%
Jun 10, 2026
148
1213
22
18.2%
Jun 10, 2026
149
1213
26
11.5%
Jun 9, 2026
151
TOPSIS-RAD: Ranking According to Desires
Leonardo Fernandes Costa, Helder Gomes Costa +2econ.EM
1204
27
14.8%
Jun 5, 2026