Program

All times are shown in local Vienna time.

Monday, August 31

Paper sessions

Oral Session #1: Document analysis systemsLocation: EI7Oral Session #2: Document forensics and provenanceLocation: EI9
TimeIDPaperIDPaper
10:40327Spatially-Grounded Gaussian-Prior Attention for Handwritten Mathematical Expression RecognitionIbtissem Haj Ali, Harold Mouchère155Doc-Protector: A Self-Healing Approach for Digital DocumentsSudev Padhi, Archana Tiwari, Umesh Kashyap, Sk. Subidh Ali
11:00154Bar-JEPA: Extracting Values from Bar Chart with Joint-Embedding Predictive ArchitectureAlexander Epple, Poonam Poonam, Timo Ropinski304Temporal Modeling of Optically Variable Devices in Identity DocumentsGlen Pouliquen, Joseph Chazalon, Guillaume Chiron, Oriol Ramos Terrades, Thierry Geraud, Ahmad Montaser Awal
11:20251GeoLogVQA: A Borehole Log Documents Dataset for Explicit and Implicit Spatial ReasoningStanislas Bagnol, Killian Barrere, Veronique Eglin, Elöd Egyed-Zsigmond, David Pitaval, Jean-Marie Côme321Improving Document Forgery Localization Robustness via Diverse JPEG Quantization TablesKylian Ronfleux Corail, Nicolas Sidere, Guillaume Bernard, Mickael Coustaty
11:40342AdaNav: Query-Adaptive Multi-Granularity Navigation for Long Document UnderstandingYiming Xu, Eric López, Artemis Llabrés, Maximiliano Hormazábal, Ernest Valveny, Dimosthenis Karatzas144Adversarial Attacks on Online Handwriting using Salience-based Temporal EditingYataro Tamura, Brian Kenji Iwana, Jiseok Lee
Journal Track Session #1Location: EI7Journal Track Session #2Location: EI9
TimeIDPaperIDPaper
13:202025-560Bharat Scene Text: A Novel Comprehensive Dataset and Benchmark for Indian Language Scene Text UnderstandingAnik De, Abhirama Subramanyam Penamakuri, Rajeev Yadav, Aditya Rathore, Harshiv Shah, Devesh Sharma, Sagar Agarwal, Pravin Kumar, Anand Mishra2025-520Reviving Medieval Byzantine Seals: A Synthetic-to-Real Approach to Character RecognitionGianluca Dalmasso, Patric Reineri, Mathieu Pscherer Noel, Ninon Achard, Beatrice Caseau, Laurence Likforman Sulem, Davide Cavagnino, Maurizio Lucenteforte, Attilio Fiandrott, I Victoria Eyharabide
13:502025-548Fidel: A Large-Scale Sentence Level Amharic OCR DatasetTunga Tessema Chamisso, Blessed Guda, Bereket Retta Adego, Carmel Prosper Sagbo, Gabrial Zencha Ashungafac, Assane Gueye2025-504DKDS: A Benchmark Dataset of Degraded Kuzushiji Documents with Seals for Detection and BinarizationRui-Yang Ju, Kohei Yamashita, Hirotaka Kameko, Shinsuke Mori
Oral Session #3: Document image processingLocation: EI7Oral Session #4: Benchmarks and DatasetsLocation: EI9
TimeIDPaperIDPaper
14:40123P-HTG: One-Shot Handwritten Text Generation via Prototype-Guided Adaptive Gated FusionHeng Wang, Yiming Wang, Hongxi Wei113JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text UnderstandingKoki Maeda, Naoaki Okazaki
15:00259Revisiting Structural Dependency in Autoregressive Multi-Task Table Recognition via Order-Independent Cell-Level RepresentationsKawakatsu, Takaya349ADV-FORMS: A Dataset of Form-Based Historical Documents With Benchmarks for Layout Analysis, HTR and OCRBernhard Ortbauer, Tobias Doppler, Pauline Schmidt, Lukas Schilcher, Wolfgang Göderle, Malte Rehbein, Alexander Werth, Roman Kern
15:2062Improving MLLM Historical Record Extraction with Test-Time Image AugmentationTaylor Archibald, Tony Martinez34Structure-Aware Text Recognition for Ancient Greek Critical EditionsNicolas Angleraud, Antonia Karamolegkou, Benoit Sagot, Thibault Clérice
15:40379Handwriting Trajectory Recovery with Diffusion ModelsHiroki Nagamatsu, Shoji Toyota, Seiichi Uchida100A Text Recognition Dataset from Sahidic Coptic Ancient ManuscriptsFabio Quattrini, Carmine Zaccagnino, Costanza Bianchi, Silvia Cascianelli, Rita Cucchiara
Poster Session #1 Location: Hallway
IDPaper
81RoWeR: RoBERTa Word error Rate estimator for OCRed textsTomás Osório, Henrique Lopes Cardoso
72A MambaVision-Based Cross-Modal Feature Enhancement Network for Scene Text Super-ResolutionRuichang Zhu, Hongxi Wei, Bo Sun, Heng Wang
247DocCenter: Center and Corner Aware Representation for Robust Multi-Document LocalizationMengyuan Zhao, Kun Xu, Xin Cheng, Ting Li, Qiuman Tan, Xinyao Zhang
13EMBLEM: Enhancing Multi-script Table Detection through MaskingDhruv Kudale, Udhay Brahmi, Ganesh Ramakrishnan
286Diffusion-Based Multi-View Reasoning for Scene Text DetectionDebayan Das Gupta, Shivakumara Palaiahnakote, Palash Ghosh, Umapada Pal, 4Cheng-Lin Liu
385MultiFOLD: A Multimodal Framework to correct OCR Lapses in cluttered DocumentsRajat Verma, Vriti Sharma, Manikandan Ravikiran, Rohit Saluja
250BinDiffuser: Learning Binary Style Priors to Guide Diffusion Models for Palm-Leaf Document BinarizationSalman K H, Chakravarthy Bhagvati
228Synthetic Training Data Generation for 3D Cuneiform Sign RecognitionJan Philipp Bullenkamp, Florian Linsel, Lisa Wilhelmi, Hubert Mara
126GSMP: Geometry-Structured Masked Pretraining with Multi-Granularity Masking and Curriculum Learning for Geometric Problem SolvingZiming Li, Jie Zhang, Xingxiang Zhou, Minzhi Zhang, Zhi Chen, Guanglai Gao, Xiangdong Su
117An Analysis of Lightweight Models for Document Image Machine TranslationAbantika Bose, Thomas Gorges, Lukas Hüttner, Linda-Sophie Schneider, Mathias Seuret, Fei Wu, Vincent Christlein
129Adaptive Hybrid Machine Translation for E-commerce: A Reinforcement Learning Approach to Arabic LocalizationAyman Hanafy, Farhan Khawar
280Token Selection Strategies for Automatic Summarization of Historical DocumentsMerlin Streilein, Tobias Steiner, Andreas Fischer, Kaspar Riesen
351Doc2Doc: Structure-Aware Generative Rendering for Bi-Directional Document TranslationFahad Alotaibi, Daulet Toibazar, Renad Almusaad, Ranya Alkahtani, Haneen Alhomoud, Asma Ibrahim, Yazeed Alharbi, Murtadha Aljubran, Pedro Moreno
211TKPE: Topic-based Evaluation for Keyphrase PredictionBingke Li, Jinghan Li, Jinhao Chen, Wu Zhuang, Yuxiang Zhang
142Active Reference Acquisition in Few-shot Font GenerationMatsuo, Shinnosuke
370Synth-JDoc: Synthesizing a Japanese Document Image Dataset for OCR with Diverse Layouts and Embedded ImagesKeito Sasagawa, Shuhei Kurita, Daisuke Kawahara
206Ambiguity-Controlled Handwritten Mathematical Expression Generation via Harmonized Dual-Conditional GuidanceRyo Ishiyama, Takaya Kawakatsu
27Multi-Modal OMR for Heterogeneous Notations: A Collaborative Framework for Real-Time Symbolic-to-Immersive MappingAnkit Sinha, Atanu Saha, Chiranjoy Chattopadhyay, Rahul Kumar Ray
43Meaning Lies in Structure: Fine-Grained Table-Centric Document Semantic ParsingXuan Li, Mengfei Li, Jingtian Wei, Jialiang Dong, Raymond Wong
103Evaluating Feedback by Iterative Repair of Multi-Step Solution DocumentsTobias Lengfeld, Jakob Seitz, Radu Timofte
234Towards Scalable Knowledge Graph Extraction from Piping and Instrumentation DiagramsSanket Deshmukh, Apurva Gala, David Blom, Detlef Hohl
147Hierarchical Co-Embedding of Font Shapes and Impression TagsYugo Kubota, Kaito Shiku, Seiichi Uchida
92Identify, Locate, Link: End-to-End Key-Value Extraction from Document ImagesAbdurrahman Said Gürbüz, Ahmed Nassar, Christoph Auer, Maksym Lysak, Lucas Morin, Matteo Omenetti, Tim Strohmeyer, Panagiotis Vagenas, Nikolaos Livathinos, Michele Dolfi, Peter Staar
125GraphVLM: Combining VLMs with GraphMLLM for Document UnderstandingShuai Li, Xiao-Hui Li, Haijie Yuan, Fei Yin, Lin-Lin Huang
318LiteDoc: Distilling Large Document Models into Efficient Task-Specific EncodersTayyab Raza, Syed Muhammad Taha Imam, Adrian Ulges, Ulrich Schwanecke, Momina Moetesum, Faisal Shafait
97CzechTopic: A Benchmark for Zero-Shot Topic Localization in Historical Czech DocumentsMartin Kostelník, Michal Hradiš, Martin Dočekal
84Patram-Bench: A Comprehensive Multi-task, Multi-domain and Multilingual Benchmark for Indian Document UnderstandingAnirudh Srinivasan, Pratyush Jena, Arya Topale, Venkat Kesav, Ravi Kiran Sarvadevabhatla
230DDD – A Diagnostic Dataset for Character Recognition and Detection on Ancient Egyptian Hieratic Characters and WordsStephan Unter, Elena Hertel
339TimeAgent: From Matches to Memories — Timeline Summarization for Sports AnalyticsSwagata Mukherjee, Samar Kumar Srivastava, Sriparna Saha
38Towards Breaking the Visual Perception Bottleneck for Geometry Problem SolvingTianjiao Cao, Jiahao Lyu, Dongbao Yang, Weimin Mu, Zhou Yu
232Synthetic Data from Simulated Lecture Environments for Handwritten Content ExtractionMin Song, Kenny Davila
270Stringalign: Moving beyond summary statistics with a transparent Unicode-aware tool for evaluating automatic transcription modelsYngve Mardal Moe, Marie Roald
170DeChart: A Benchmark and Text-Enhanced Chart-to-Table Conversion Method with Multimodal LLMsChen-Yu Xie, Xiao-Hui Li, Fei Yin, Cheng-Lin Liu
200BullingerDB: A Dataset for Handwritten Text Recognition and Writer RetrievalMarco Peer, Anna Scius-Bertrand, Patricia Scheurer, Andreas Fischer
93n-gram injection into transformers for dynamic language model adaptation in handwritten text recognitionFlorent Meyer, Laurent Guichard, Yann Soullard, Denis Coquenet, Guillaume Gravier, Bertrand Coüasnon
136G2I: A Progressive Structure-to-Detail Curriculum Training Strategy for Handwritten Mathematical Expression RecognitionMinzhi Zhang, Xingxiang Zhou, Ziming Li, Jie Zhang, Zhi Chen, Xiangdong Su
137Few-Shot Writer Adaptation via Multimodal In-Context LearningTom Simon, Pierrick Tranouez, Stephane Nicolas, Clement Chatelain, Thierry Paquet
145Specialized HTR vs Vision-Language Models: Evaluating DANIEL and Fine-Tuned Qwen on Historical DocumentsGabriel Frossard, Franck Gechter
203A Millennium of Arabic Manuscripts in Three Styles: A Line-Level OCR Benchmark for Naskh, Taliq, and NastaliqMaxim Novopoltsev, Ruslan Murtazin, Andrey Sakhovskiy, Emilia Bojarskaja, Vladimir Kokh, Ivan Ulitin, Botirjon Abdullayev, Khamidulla Aminov, Masudkhon Ismoilov, Semen Budennyy
330DiffusionRec: Recognition-Guided Diffusion for Content-Aware Urdu Handwriting GenerationSaima Kausar, Ayesha Amjad, Ahmad Sarmad Ali, Momina Moetesum, Adnan Ul Hasan, Faisal Shafait
355Online Urdu Text-Line Recognition by Bridging Stroke Dynamics and Offline RepresentationsAli Hussain, Rafay Ahmad, Momina Moetesum, Adnan Ul-Hasan, Faisal Shafait
28Enhancing IMU-Based Online Handwriting Recognition via Contrastive Learning with Zero Inference OverheadJindong Li, Dario Zanca, Vincent Christlein, Tim Hamann, Jens Barth, Peter Kämpf, Björn Eskofier
239Reference-Free Handwritten Japanese Character Generation via CLIP-Conditioned Diffusion ModelsKoki Fujita, Hideaki Yajima, Chee Siang Leow, Hiromitsu Nishizaki
141METATR: A Multilingual, Evolving Benchmark for Automatic Text RecognitionMélodie Boillet, Solène Tarride, Christopher Kermorvant
65Writer Retrieval at ScaleTim Raven, Tim Hallyburton, Gernot A. Fink
252Benchmarking Information Retrieval for Large Archives of Historical DocumentsTobias Steiner, Merlin Streilein, Andreas Fischer, Kaspar Riesen
303RAGXDoc: Structured Knowledge-guided Retrieval and Explainable Re-ranking for Academic DocumentsDipendra Sharma Kafle, Esma Talhi, Mickael Coustaty, Antoine Doucet
316Recent Advances in Information Extraction from Historical Archival RecordsArthur Matei, Tim Hallyburton, Lukas Hennies, Christoph Rass, Gernot A. Fink
16An Exploratory Study of Text-to-Image Generation for Query-by-Example Retrieval of Historical Document ImagesMelissa Cote, Alexandra Branzan Albu
31LMS-Retrieval: Layout-Aware, Modality-Aware, Structure-Aware Document RetrievalMan Qin, Tim French, Wei Liu

Tuesday, September 1

Paper sessions

Oral Session #5: Graphics recognitionLocation: EI7Oral Session #6: Handwriting recognitionLocation: EI9
TimeIDPaperIDPaper
09:00161Optical Music Recognition for Real-World Manuscripts with Synthetic DataJiří Mayer, Martina Dvořáková, Vojtěch Dvořák, Markéta Herzánová Vlková, Filip Jebavý, Pavel Pecina, Samuel Šomorjai, Petr Žabička, Jr., Jan Hajič347InkTree: A Unified Representation of Structured Online InkJakob Seitz, Tobias Lengfeld, Radu Timofte
09:20204EAGLE: Explicit Anchoring and Graph Reasoning with Diagram Structure Priors for Multimodal Geometry Problem SolvingJie Zhang, Xiangren Wang, Ziming Li, Minzhi Zhang, Xingxiang Zhou, Zhi Chen, Guanglai Gao, Xiangdong Su30Democratizing the medieval English legal traditionMichael Zhang, Elise Wang, Charlotte Whatley, Seth Strickland, Dylan Bannon
09:4089Stroke-Level Connectivity Verification: Grounding Vision-Language Models Against Topology Hallucination in Diagram UnderstandingAbdullah Ibne Hanif Arean, Niamul Hassan Samin, Md Arifur Rahman, Renu Akter Sweety, Juena Ahmed Noshin, Md Ashikur Rahman102Ad-hoc Personalization of Offline Handwriting Recognition Using Style TransferSaid Yasin, Torsten Zesch
10:00174CPAgent: A Tool-Augmented Agentic Framework for Chart ParsingChen-Yu Xie, Xiao-Hui Li, Boran Wang, Fei Yin, Cheng-Lin Liu159Character Template Representation for Confidence Learning in Handwritten Text RecognitionYangyang Liu, Heng Zhang, Fei Yin, Cheng-Lin Liu
Oral Session #7: Historical document analysisLocation: EI7Oral Session #8: Layout AnalysisLocation: EI9
TimeIDPaperIDPaper
10:40114Blind Image Decomposition for Recovering Overlapping Text Layers on PalimpsestsBaharan Pourahmadi, Panagiotis Leontaridis, Paolo Scattolin, Mads Toudal Frandsen88REPLICA: An Agentic Framework for Visually Faithful Document ReconstructionRaghuveer R, Anirudh Srinivasan, Venkat Kesav Venna, Shanmukha Sreevatsa Tallapragada, Aryan Jain, Sahithi Kukkala, Ravi Kiran Sarvadevabhatla
11:00175Quality Prediction for Large Scale HTR – Confidence Is All You NeedErik Lenas, Viktoria Lofgren, Olof Karsvall169HKGC: A Hierarchical Knowledge Graph Construction Framework for Structure-Aware RAGYingxin Guan, Jian Xing, Zhaohua Zheng, Zhaofu Zeng, Bai Lei, Fanchen Meng, Haitao Guo
11:20380Beyond Labels: Visual Invariance in Self-Supervised Learning for Aramaic Incantation BowlsNour Atamni, Boraq Madi, Islam Amar, Raid Saabni, Jihad El-Sana183Complex Layout Classification in the Wild: A Low-Resource Approach with Layout-Preserving AugmentationsSharva Gogawale, Iddo Hakim, Gal Grudka, Mohammad Suliman, Omer Ventura, Daria Shapira, Berat Barakat, Nachum Dershowitz
11:4098Decipherment of Oracle Bone Inscription via Component Deconstruction and AlignmentYuanhui Lin, Hetao Wu, Qingju Jiao, Yongge Liu, Da-Han Wang289GRaF-Net: a Multi-Branch Gated Residual Architecture for Floor Plan Semantic SegmentationAxel De Nardin, Silvia Zottin, Claudio Piciarelli, Gian Luca Foresti
Journal Track Session #3Location: EI7Journal Track Session #4Location: EI9
TimeIDPaperIDPaper
14:402025-565TableSeq: Unified Generation of Structure, Content, and LayoutLaziz Hamdi Amine Tamasna Pascal Boisson Thierry Paquet2025-482A Novel Domain Adaptation Based Pipeline for Character Classification and Handwritten RecognitionFlorent Imbert, Simon Corbillé, Hui Han, Elisa H. Barney Smith
15:102025-561PILOT: A Promptable Interleaved Layout-aware OCR TransformerLaziz Hamdi Amine Tamasna Pascal Boisson Thierry Paquet2025-479Unsupervised Document and Template Clustering using Multimodal EmbeddingsPhillipe R. Sampaio, Helene Maxcici
15:402025-539Automatic Uncertainty-Aware Synthetic Data Bootstrapping for Historical Map SegmentationLukas Arzoumanidis, Julius Knechtel, Jan-Henrik Haunert, Youness Dehbi2025-468HQ-Font: Few-shot Font Generation via Transferring Hierarchical Quantization StylesAnna Zhu, Wei Pan, Guan Li, Hongyi Cai, Kenji Brian
16:102025-524AIKON: A Modular Computer Vision Platform for Historical CorporaSégolène Albouy, Somkeo Norindr, Paul Kervegan, Fouad Aouinti, Rémy Delanaux, Clara Grometto, Robin Champenois, Stavros Lazaris, Alexandre Guilbaud, Matthieu Husson, Mathieu Aubry2025-403Predicting Text Recognition Word Error Rate of Image Documents Without Ground Truth TranscriptsEnrique Vidal, Alejandro H. Toselli

Wednesday, September 2

Paper sessions

Oral Session #9: Recognition of tables and formulasLocation: EI7Oral Session #10: Text and symbol recognitionLocation: EI9
TimeIDPaperIDPaper
10:40197Beyond Rows to Reasoning: Agentic Retrieval for Multimodal Spreadsheet UnderstandingAnmol Gulati, Sahil Sen, Waqar Sarguroh, Kevin Paul309What Can Languages of the Global South Teach Each Other?Achyuth P, Kahaan Shah, Chetan Arora
11:00209Beyond the Page Break: An LLM-based Solution for Cross-Page Table ReconstructionYu Tang, Hongwei Li, Yixuan Cao, Ping Luo138A Benchmark of State-Space Models vs. Transformers and BiLSTM-based Models for Historical Newspaper OCRMerveilles Agbeti-Messan, Thierry Paquet, Pierrick Tranouez, Clement Chatelain, Stephane Nicolas
11:20306PatentME: A Dataset and Reference-Free Post-OCR Verification Task for Printed Mathematical Expression RecognitionFrançois Wieckowiak, Véronique Eglin, Tony Bonnet, Stéphane Bres, Laëtitia Rousseau146Evolution-Guided Diffusion for Oracle Bone Script DeciphermentHaotian Chen, Hetao Wu, Qingju Jiao, Yongge Liu, Da-Han Wang
11:40393Multilingual Table Recognition: A Benchmark Dataset and A Local–Global Hybrid ModelNam Tuan Ly, Atsuhiro Takasu, Masaki Nakagawa225Towards Universal Khmer Text RecognitionMarry Kong, Rina Buoy, Sovisal Chenda, Nguonly Taing, Masakazu Iwamura, Koichi Kise
Poster Session #2 Location: Hallway
IDPaper
83UniLipi: A Unified Multi-Script OCR for Historical Indic ManuscriptsTathagata Ghosh, Sai Madhusudan Gunda, Simran Singh Sandral, Ravi Kiran Sarvadevabhatla
120Learning Diachronic Representations of Ancient Greek LetterformsJohn Pavlopoulos, Spyros Barbakos, Lavinia Ferretti, Dionysis Voulgarakis, Asimina Paparrigopoulou, Maria Konstantinidou, Giuseppe De Gregorio, Isabelle Marthot-Santaniello, Paraskevi Platanou, Holger Essler
157Automatic Layout Detection in Historical Civil Records Using Deep Object DetectionWissam Alkendi, Franck Gechter, Laurent Heyberger, Christophe Guyeux
261Leveraging Morphology for Historical Script Metrological AnalysisMalamatenia Vlachou Efstathiou, Raphaël Baena, Dominique Stutzmann, Mathieu Aubry
301One Model, Many Guidelines: Instruction Fine-Tuning for Historical Named Entity RecognitionNam Nguyen, Emanuela Boros, Adam Jatowt, Ahmed Hamdi, Mickael Coustaty, Antoine Doucet
10Robust Interpretation of Historical Documents in Knowledge Graphs Through Query Inference and ExecutionSebastià Nicolau Orell, Adrià Molina Rodríguez, Oriol Ramos Terrades, Josep Lladós Canet
17Conversational Retrieval and On-the-Fly Knowledge Modeling from Historical DocumentsPaula Font Solà, Adrià Molina Rodríguez, Josep Lladós Canet
21Revisiting how we access to historical archives: Auditing Gender Stereotypes and the Division of Labour in the Analysis of Historical Photography CollectionsFrancesc Net Barnes, Adrià Molina Rodríguez, Sofia Llacer-Caro, Lluis Gomez Bigorda
55HME-Leibniz: A Multi-level Mathematical Expression Dataset from Leibniz's ManuscriptsYejing Xie, Ze Qian, Yunfan Li, David Rabouin, Harold Mouchère
66EpiSAM: Character Segmentation in Challenging Stone InscriptionsArnav Sharma, Pratyush Jena, Amal Joseph, Ravi Kiran Sarvadevabhatla
74Parameter-Efficient and Adaptive Fine-Tuning for Long-Tailed Ancient Characters RecognitionHao Wang, Aouaidjia Kamel, Konstantinos Kotropoulos, Chongsheng Zhang
82Evaluating Vision-Language Models on Historical PostcardsMatthieu Pelingre, Salvatore Tabbone
256Text region detection in historical astronomical diagramsZeynep Sonat Baltaci, Raphael Baena, Fei Meng, Somkeo Norindr, Florence Somer, Matthieu Husson, Mathieu Aubry
312Bridging the Gaps: Learning to Estimate Missing Text in Fragmentary Greek InscriptionsSilvia Zottin, Axel De Nardin, Valentina Mignosa, Maddalena Zunino, Gian Luca Foresti
320Angkorian-KSI: A Multi-Task Benchmark for Khmer Stone Inscription AnalysisNimol Thuon, Jun Du, Ranysakol Thuon, Panhapin Theang
391Vision-Language Model based Transfer Learning for Historical Document RecognitionYifan Huang, Liangrui Peng, Tianqi Zhao, Di Wu, Kemeng Zhao, Shuo Li, Zhiyu Li, Yuyang Li
282HIDRA: Hierarchical Ink-aware Dual-granularity Retrieval Architecture for Historical FragmentsJihad Al Akl, Chady Abou Jaoude, Zahi Al Chami, Marianne Abi Kanaan, Abdallah Makhoul
300Theatre Chapbooks At Scale: A Statistical Comparative Analysis of TypographyDiego Belzarena, Seginus Mowlavi, Paula Casariego Castiñeira, Alejandra Ulla Lorenzo, Gregory Randall, Jean-Michel Morel
298Generalized Open-set Single-shot Character Recognition on Ancient Egyptian Hieratic CharactersStephan Unter, Chang Liu, Elisa Barney Smith
208Automated Character-Level Annotation for Historical Nom Documents via an Iterative Self-Updating Radical-Aware RecognizerCuong Nguyen, Khoa Nguyen Tran, Ngoc Tuan Nguyen, Hung Tuan Nguyen, Nam Tuan Ly, Masaki Nakagawa
213MaPE-Former: A Mask-Aware Position Encoding Network for Chinese Character Image RestorationWei Wei, Xinrui Liu, Jianxin Zhang, Xiaodong Duan
153From Pixels to Structure: Lightweight Vision-Language Models for Document OCR and Structured JSON ExtractionUddipan Basu Bir, Vincent Christlein, Andreas Maier, Mathias Zinnen
57Towards Non-Latin Character and Layout Personalization for Enhanced ReadabilityRina Buoy, Dylan Berkamp Fouepe Dongmo, Vesal Khean, Simone Marinai, Koichi Kise
314Efficient Table QA via TableGrid Navigation and Progressive Inference PromptingAmritansh Maurya, Navjot Singh, Mohammed Javed, Omar Moured
231Can VLMs Understand Handwritten Mathematical Documents?Shree Mitra, Ajoy Mondal, C. V. Jawahar
85GRACE: Gradient-Regulated Approach for Consistent ExplanationsBasu, Sayantan
311GLiDRE: Generalist Lightweight Model for Document-level Relation ExtractionRobin Armingaud, Romaric Besançon
260From Chunks to Graphs: Training-Free Multimodal Late Interaction for Document UnderstandingAyush Lodh, Souparni Mazumder, Sanket Biswas, Josep Llados, Nisha Singh
387Figures as Evidence: Multi-Image Scientific GenerationJawad Ibn Ahad, Mritunjoy Chakraborty, Fuad Rahman, Sifat Momen, Shafin Rahman, Nabeel Mohammed
64Prediction of Grade, Gender, and Academic Performance of Children and Teenagers from Handwriting Using the Sigma-Lognormal ModelAdrian Iste, Kazuki Nishizawa, Chisa Tanaka, Andrew Vargo, Anna Scius-Bertrand, Andreas Fischer, Koichi Kise
216Hierarchical Stroke-Level Clustering and Step-Level Segmentation for Automatic Scoring of Geometric Construction Answers with an Electronic Drawing CompassThanh-Nghia Truong, Hung Tuan Nguyen, Nam Tuan Ly, Yoichi Tsuchida, Hiroshi Miyazawa, Tomo Asakura, Masamitsu Ito, Toshihiko Horie, Fumiko Yasuno, Masaki Nakagawa
227Towards Khmer Scene Document Layout DetectionMarry Kong, Rina Buoy, Sovisal Chenda, Nguonly Taing, Masakazu Iwamura, Koichi Kise
258Bounding Box Label Propagation for Re-Annotation of Document Layout Analysis DatasetsNick Jochum, Tobias Alt-Veit, Christian Schön, Alexander Lück, René Schuster, Didier Stricker
73Counterfeit Answers: Adversarial Forgery against OCR-Free Document Visual Question AnsweringMarco Pintore, Maura Pintor, Battista Biggio, Dimosthenis Karatzas
173Active Learning for Cascaded Object Detection: Balancing Coverage and Uncertainty in Table Extraction PipelinesEliott Thomas, Mickael Coustaty, Aurélie Joseph, Gaspar Deloin, Vincent Poulain D'andecy, Jean-Marc Ogier
242FastTab: A Fast Table Recognizer with a Tiny Recursive Module and 1D TransformersLaziz Hamdi, Amine Tamasna, Pascal Boisson, Thierry Paquet
192ConRTF: Edge-Constrained Boundary Distribution Refinement for Realtime TransFormer Table Structure RecognitionEliott Thomas, Tri-Cong Pham, Mickael Coustaty, Aurélie Joseph, Gaspar Deloin, Vincent Poulain D'andecy, Jean-Marc Ogier, Antoine Doucet
156SCALES: Scalable Context-Aware Learning with Expert Specialization for Incremental Multilingual Text RecognitionQing Lin, Xiaohui Li, Heng Zhang, Fei Yin, Chenglin Liu
151Multi-Modal Deep Learning for Medieval Inscription Recognition: A Study of Saint Sophia Cathedral GraffitiAram Karimi, Jonathan Westine, Gunnar Almevik
139AOSSig4000: A Real-World Chinese Handwritten Signature Dataset with Diverse Background Noise and Pixel-Level AnnotationsXunhui Qin, Desheng Wang, Kunpeng Gui, Fang Shi, Zhonghao Shen, Du Zhou, Ke Liu, Peirong Zhang, Yang Xue, Lianwen Jin
91Online Signature Verification Using Augmented Path Signature and T-MambaRuiling Li, Danyu Yang
23Master Forgers, Fragile Detectors? A Forensic Study of Vision-Language Models for Signature VerificationKumari Priya, Bibek Das, Chandranath Adak, Soumi Chattopadhyay
122Preserving High-Fidelity Character Structure in Handwritten Text Generation via Multimodal GuidanceHeng Wang, Yiming Wang, Hongxi Wei
41LLMSFL: LLM-Driven Smart Feedback Loop System for Target Document GenerationZezhong Guo, Yongjian Zhang
167Arbitrary Glyph and Multi-Resolution Font Generation with Mixed Content RepresentationsXiaoge Chen, Shilin Li, Leilei Yao, Anna Zhu
356MIDAS: Multi-LLM Iterative Data-Adaptive SummarizationKaren Lee, Dhanashree Balaram, Seojun Shon, Umair Rasheed
384Agentic Document Reasoning for Evidence-Grounded Clinical Report GenerationHira Masood, Momina Moetesum, Muhammad Imran Malik, Faisal Shafait, Hassan Aqeel Khan
158Vision Language Models as OCR Correctors for Historical TextsRadoslav Koynov, Triet Ho Anh Doan, Philipp Wieder
313Error Patterns in Historical OCR: A Comparative Analysis of TrOCR and a Vision–Language ModelAri Vesalainen, Eetu Mäkelä, Laura Ruotsalainen, Mikko Tolonen
341Structural Analysis of Character Identity at OCR Decision Boundaries in Visually Similar PairsHaraguchi, Daichi
Competitions #1Location: EI7Competitions #2Location: EI9
IDCompetitionIDCompetition
C3ICDAR 2026 FalsID Competition on Falsification and Imitation DetectionC1ICDAR 2026 Competition on Information Extraction from Atomic Layer Deposition/Etching (ALD/E) Scientific Figures
C4ICDAR 2026 Competition on Long-Term Handwriting Author IdentificationC2ICDAR 2026 Competition on Multimodal Reasoning over Documents in Multiple Domains
C5ICDAR 2026 Competition on Multilingual Medieval Handwriting RecognitionC6ICDAR 2026 HIPE-OCRepair Competition on LLM-Assisted OCR Post-Correction for Historical Documents
C8ICDAR 2026 Competition on Writer Identification and Pen Classification from Hand-Drawn CirclesC7ICDAR 2026 Competition in Text Recognition on Greek Squeezes