Improving text-to-sql evaluation methodology

Author: cecz

August undefined, 2024

WitrynaDespite achieving good performance on some public benchmarks, existing text-to-SQL models typically rely on the lexical matching between words in natural language (NL) questions and tokens in table schemas, which may render the models vulnerable to attacks that break the schema linking mechanism. WitrynaImproving Text-to-SQL Evaluation Methodology Catherine Finegan-Dollak, Jonathan K. Kummerfeld, Li Zhang, Karthik Ramanathan, Sesh Sadasivam, Rui Zhang, …

Rui Zhang - GitHub Pages

WitrynaFirst, we compare human-generated and automatically generated questions, characterizing properties of queries necessary for real-world applications. To facilitate … Witryna2 dni temu · Improving Text-to-SQL Evaluation Methodology. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume … circle with a dot in the middle latex

NL2SQL: Natural Language to SQL Query Translator

WitrynaIn the process, we (1) introduce a new, challenging dataset, (2) standardize and fix many errors in existing datasets, and (3) propose a simple yet effective baseline … Witrynaquestion-based split fails to evaluate a system’s generalizability . In addition, by analyzing properties of human-generated and automatically generated text-to-SQL datasets, we show the need to evaluate on more than one dataset to ensure systems perform well on realistic data. And we release improved resources to facilitate such … WitrynaSemantic co-reference and ellipsis always lead to information deficiency when parsing natural language utterances with SQL in a multi-turn dialogue (i.e., conversational text-to-SQL task). The methodology of dividing a dialogue understanding task into dialogue utterance rewriting and language understanding is feasible to tackle this problem. To … circle with a dot symbol

Distributional Generalization in Natural Language Processing.

Improving Text-to-SQL Evaluation Methodology - ACL 2024

Witryna23 cze 2024 · First, we compare human-generated and automatically generated questions, characterizing properties of queries necessary for real-world applications. … WitrynaThis paper presents a practical usability investigation of recurrent neural networks (RNNs) to determine the best-suited machine learning method for estimating electric vehicle (EV) batteries’ state of charge. Using models from multiple published sources and cross-validation testing with several driving scenarios to determine the state of charge … circle with a dash through itWitryna18 gru 2024 · We define a new complex and cross-domain semantic parsing and text-to-SQL task where different complex SQL queries and databases appear in train and test sets. In this way, the task requires the model to generalize well to both new SQL queries and new database schemas. circle with a cross inside

"WitrynaPaper Improving Text-to-SQL Evaluation Methodology, Finegan-Dollak C, Kummerfeld J K, Zhang L, et al., ACL 2024 WikiTableQuestions Home … " - Improving text-to-sql evaluation methodology

Improving text-to-sql evaluation methodology

DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL …

Witryna16 mar 2024 · Improving Text-to-SQL with a Hybrid Decoding Method Entropy (Basel) doi: 10.3390/e25030513. Authors Geunyeong Jeong 1 , Mirae Han 1 , Seulgi Kim 2 , … Witrynarent evaluations of text-to-SQL systems. First, we compare human-generated and automatically generated questions, char-acterizing properties of queries necessary for …

Did you know?

Witryna12 gru 2024 · Improving Text-to-SQL Evaluation Methodology. Catherine Finegan-Dollak, Jonathan K. Kummerfeld, Li Zhang, Karthik Ramanathan, Sesh Sadasivam, Rui Zhang, Dragomir R. Radev. Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention. Pengcheng Yin, Hao Fang, Graham Neubig, … WitrynaFirst, we compare human-generated and automatically generated questions, characterizing properties of queries necessary for real-world applications. To facilitate …

Witryna20 lip 2024 · First, we compare human-generated and automatically generated questions, characterizing properties of queries necessary for real-world applications. To facilitate evaluation on multiple datasets, we release standardized and improved versions of seven existing datasets and one new text-to-SQL dataset. Witryna9 lut 2024 · The experiment results of evaluating the performance of the two-stage frameworks using different rewrite models show that the efficiency of rewrite models is important and still needs improvement. ... conversational text-to-SQL task). The methodology of dividing a dialogue understanding task into dialogue utterance …

WitrynaImproving Text-to-SQL Evaluation Methodology. Preprint. Jun 2024; Catherine Finegan-Dollak; ... We identify limitations of and propose improvements to current evaluations of text-to-SQL systems ... WitrynaImproving Text-to-SQL Evaluation Methodology. To be informative, an evaluation must measure how well systems generalize to realistic unseen data. We identify limitations …

Witryna1 sty 2024 · Most prior works on text-to-SQL tasks focus on the crossdomain generalization, which mainly assess how the models generalize the domain …

Witryna1 lis 2024 · Improving text-to-sql evaluation methodology. arXiv preprint arXiv:1806.09029 (2024). Matt Gardner, Yoav Artzi, Victoria Basmov, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, et al. 2024. Evaluating Models' Local Decision Boundaries via … circle with a i in itWitryna3 mar 2024 · This paper presents a simple yet effective data augmentation framework. First, given a database, we automatically produce a large amount of SQL queries based on an abstract syntax tree grammar. We require the … circle with a in it meaningWitryna1 sty 2024 · Text-to-SQL parsing aims to automatically transform natural language (NL) questions into SQL queries based on the given databases (DBs) (Tang and Mooney, 2001), as depicted at the top of Figure... diamondbolt transmorphersWitryna16 mar 2024 · Text-to-SQL is a task that converts natural language questions into SQL queries. Recent text-to-SQL models employ two decoding methods: sketch-based … diamond bombe ringWitrynaThis repository contains data and code for building and evaluating systems that map sentences to SQL, developed as part of: Improving Text-to-SQL Evaluation … circle with a hole in itWitryna23 cze 2024 · First, we compare human-generated and automatically generated questions, characterizing properties of queries necessary for real-world applications. … diamond bombenrohreWitryna11 wrz 2024 · We introduce Spider-DK, a human-curated dataset based on the Spider benchmark for evaluating the generalization of text-to-SQL models, with the focus of understanding the domain knowledge. We demonstrate that the performance of existing text-to-SQL models drops dramatically on Spider-DK, even if the domain knowledge … diamond bomb rose rave