| 83 | UniLipi: A Unified Multi-Script OCR for Historical Indic ManuscriptsTathagata Ghosh, Sai Madhusudan Gunda, Simran Singh Sandral, Ravi Kiran Sarvadevabhatla |
| 120 | Learning Diachronic Representations of Ancient Greek LetterformsJohn Pavlopoulos, Spyros Barbakos, Lavinia Ferretti, Dionysis Voulgarakis, Asimina Paparrigopoulou, Maria Konstantinidou, Giuseppe De Gregorio, Isabelle Marthot-Santaniello, Paraskevi Platanou, Holger Essler |
| 157 | Automatic Layout Detection in Historical Civil Records Using Deep Object DetectionWissam Alkendi, Franck Gechter, Laurent Heyberger, Christophe Guyeux |
| 261 | Leveraging Morphology for Historical Script Metrological AnalysisMalamatenia Vlachou Efstathiou, Raphaël Baena, Dominique Stutzmann, Mathieu Aubry |
| 301 | One Model, Many Guidelines: Instruction Fine-Tuning for Historical Named Entity RecognitionNam Nguyen, Emanuela Boros, Adam Jatowt, Ahmed Hamdi, Mickael Coustaty, Antoine Doucet |
| 10 | Robust Interpretation of Historical Documents in Knowledge Graphs Through Query Inference and ExecutionSebastià Nicolau Orell, Adrià Molina Rodríguez, Oriol Ramos Terrades, Josep Lladós Canet |
| 17 | Conversational Retrieval and On-the-Fly Knowledge Modeling from Historical DocumentsPaula Font Solà, Adrià Molina Rodríguez, Josep Lladós Canet |
| 21 | Revisiting how we access to historical archives: Auditing Gender Stereotypes and the Division of Labour in the Analysis of Historical Photography CollectionsFrancesc Net Barnes, Adrià Molina Rodríguez, Sofia Llacer-Caro, Lluis Gomez Bigorda |
| 55 | HME-Leibniz: A Multi-level Mathematical Expression Dataset from Leibniz's ManuscriptsYejing Xie, Ze Qian, Yunfan Li, David Rabouin, Harold Mouchère |
| 66 | EpiSAM: Character Segmentation in Challenging Stone InscriptionsArnav Sharma, Pratyush Jena, Amal Joseph, Ravi Kiran Sarvadevabhatla |
| 74 | Parameter-Efficient and Adaptive Fine-Tuning for Long-Tailed Ancient Characters RecognitionHao Wang, Aouaidjia Kamel, Konstantinos Kotropoulos, Chongsheng Zhang |
| 82 | Evaluating Vision-Language Models on Historical PostcardsMatthieu Pelingre, Salvatore Tabbone |
| 256 | Text region detection in historical astronomical diagramsZeynep Sonat Baltaci, Raphael Baena, Fei Meng, Somkeo Norindr, Florence Somer, Matthieu Husson, Mathieu Aubry |
| 312 | Bridging the Gaps: Learning to Estimate Missing Text in Fragmentary Greek InscriptionsSilvia Zottin, Axel De Nardin, Valentina Mignosa, Maddalena Zunino, Gian Luca Foresti |
| 320 | Angkorian-KSI: A Multi-Task Benchmark for Khmer Stone Inscription AnalysisNimol Thuon, Jun Du, Ranysakol Thuon, Panhapin Theang |
| 391 | Vision-Language Model based Transfer Learning for Historical Document RecognitionYifan Huang, Liangrui Peng, Tianqi Zhao, Di Wu, Kemeng Zhao, Shuo Li, Zhiyu Li, Yuyang Li |
| 282 | HIDRA: Hierarchical Ink-aware Dual-granularity Retrieval Architecture for Historical FragmentsJihad Al Akl, Chady Abou Jaoude, Zahi Al Chami, Marianne Abi Kanaan, Abdallah Makhoul |
| 300 | Theatre Chapbooks At Scale: A Statistical Comparative Analysis of TypographyDiego Belzarena, Seginus Mowlavi, Paula Casariego Castiñeira, Alejandra Ulla Lorenzo, Gregory Randall, Jean-Michel Morel |
| 298 | Generalized Open-set Single-shot Character Recognition on Ancient Egyptian Hieratic CharactersStephan Unter, Chang Liu, Elisa Barney Smith |
| 208 | Automated Character-Level Annotation for Historical Nom Documents via an Iterative Self-Updating Radical-Aware RecognizerCuong Nguyen, Khoa Nguyen Tran, Ngoc Tuan Nguyen, Hung Tuan Nguyen, Nam Tuan Ly, Masaki Nakagawa |
| 213 | MaPE-Former: A Mask-Aware Position Encoding Network for Chinese Character Image RestorationWei Wei, Xinrui Liu, Jianxin Zhang, Xiaodong Duan |
| 153 | From Pixels to Structure: Lightweight Vision-Language Models for Document OCR and Structured JSON ExtractionUddipan Basu Bir, Vincent Christlein, Andreas Maier, Mathias Zinnen |
| 57 | Towards Non-Latin Character and Layout Personalization for Enhanced ReadabilityRina Buoy, Dylan Berkamp Fouepe Dongmo, Vesal Khean, Simone Marinai, Koichi Kise |
| 314 | Efficient Table QA via TableGrid Navigation and Progressive Inference PromptingAmritansh Maurya, Navjot Singh, Mohammed Javed, Omar Moured |
| 231 | Can VLMs Understand Handwritten Mathematical Documents?Shree Mitra, Ajoy Mondal, C. V. Jawahar |
| 85 | GRACE: Gradient-Regulated Approach for Consistent ExplanationsBasu, Sayantan |
| 311 | GLiDRE: Generalist Lightweight Model for Document-level Relation ExtractionRobin Armingaud, Romaric Besançon |
| 260 | From Chunks to Graphs: Training-Free Multimodal Late Interaction for Document UnderstandingAyush Lodh, Souparni Mazumder, Sanket Biswas, Josep Llados, Nisha Singh |
| 387 | Figures as Evidence: Multi-Image Scientific GenerationJawad Ibn Ahad, Mritunjoy Chakraborty, Fuad Rahman, Sifat Momen, Shafin Rahman, Nabeel Mohammed |
| 64 | Prediction of Grade, Gender, and Academic Performance of Children and Teenagers from Handwriting Using the Sigma-Lognormal ModelAdrian Iste, Kazuki Nishizawa, Chisa Tanaka, Andrew Vargo, Anna Scius-Bertrand, Andreas Fischer, Koichi Kise |
| 216 | Hierarchical Stroke-Level Clustering and Step-Level Segmentation for Automatic Scoring of Geometric Construction Answers with an Electronic Drawing CompassThanh-Nghia Truong, Hung Tuan Nguyen, Nam Tuan Ly, Yoichi Tsuchida, Hiroshi Miyazawa, Tomo Asakura, Masamitsu Ito, Toshihiko Horie, Fumiko Yasuno, Masaki Nakagawa |
| 227 | Towards Khmer Scene Document Layout DetectionMarry Kong, Rina Buoy, Sovisal Chenda, Nguonly Taing, Masakazu Iwamura, Koichi Kise |
| 258 | Bounding Box Label Propagation for Re-Annotation of Document Layout Analysis DatasetsNick Jochum, Tobias Alt-Veit, Christian Schön, Alexander Lück, René Schuster, Didier Stricker |
| 73 | Counterfeit Answers: Adversarial Forgery against OCR-Free Document Visual Question AnsweringMarco Pintore, Maura Pintor, Battista Biggio, Dimosthenis Karatzas |
| 173 | Active Learning for Cascaded Object Detection: Balancing Coverage and Uncertainty in Table Extraction PipelinesEliott Thomas, Mickael Coustaty, Aurélie Joseph, Gaspar Deloin, Vincent Poulain D'andecy, Jean-Marc Ogier |
| 242 | FastTab: A Fast Table Recognizer with a Tiny Recursive Module and 1D TransformersLaziz Hamdi, Amine Tamasna, Pascal Boisson, Thierry Paquet |
| 192 | ConRTF: Edge-Constrained Boundary Distribution Refinement for Realtime TransFormer Table Structure RecognitionEliott Thomas, Tri-Cong Pham, Mickael Coustaty, Aurélie Joseph, Gaspar Deloin, Vincent Poulain D'andecy, Jean-Marc Ogier, Antoine Doucet |
| 156 | SCALES: Scalable Context-Aware Learning with Expert Specialization for Incremental Multilingual Text RecognitionQing Lin, Xiaohui Li, Heng Zhang, Fei Yin, Chenglin Liu |
| 151 | Multi-Modal Deep Learning for Medieval Inscription Recognition: A Study of Saint Sophia Cathedral GraffitiAram Karimi, Jonathan Westine, Gunnar Almevik |
| 139 | AOSSig4000: A Real-World Chinese Handwritten Signature Dataset with Diverse Background Noise and Pixel-Level AnnotationsXunhui Qin, Desheng Wang, Kunpeng Gui, Fang Shi, Zhonghao Shen, Du Zhou, Ke Liu, Peirong Zhang, Yang Xue, Lianwen Jin |
| 91 | Online Signature Verification Using Augmented Path Signature and T-MambaRuiling Li, Danyu Yang |
| 23 | Master Forgers, Fragile Detectors? A Forensic Study of Vision-Language Models for Signature VerificationKumari Priya, Bibek Das, Chandranath Adak, Soumi Chattopadhyay |
| 122 | Preserving High-Fidelity Character Structure in Handwritten Text Generation via Multimodal GuidanceHeng Wang, Yiming Wang, Hongxi Wei |
| 41 | LLMSFL: LLM-Driven Smart Feedback Loop System for Target Document GenerationZezhong Guo, Yongjian Zhang |
| 167 | Arbitrary Glyph and Multi-Resolution Font Generation with Mixed Content RepresentationsXiaoge Chen, Shilin Li, Leilei Yao, Anna Zhu |
| 356 | MIDAS: Multi-LLM Iterative Data-Adaptive SummarizationKaren Lee, Dhanashree Balaram, Seojun Shon, Umair Rasheed |
| 384 | Agentic Document Reasoning for Evidence-Grounded Clinical Report GenerationHira Masood, Momina Moetesum, Muhammad Imran Malik, Faisal Shafait, Hassan Aqeel Khan |
| 158 | Vision Language Models as OCR Correctors for Historical TextsRadoslav Koynov, Triet Ho Anh Doan, Philipp Wieder |
| 313 | Error Patterns in Historical OCR: A Comparative Analysis of TrOCR and a Vision–Language ModelAri Vesalainen, Eetu Mäkelä, Laura Ruotsalainen, Mikko Tolonen |
| 341 | Structural Analysis of Character Identity at OCR Decision Boundaries in Visually Similar PairsHaraguchi, Daichi |