REPRESENTING ASPECTUAL MEANING IN SENTENCE: COMPUTATIONAL MODELING BASED ON CHINESE

Representing Aspectual Meaning in Sentence: Computational Modeling Based on Chinese

Representing Aspectual Meaning in Sentence: Computational Modeling Based on Chinese

Blog Article

Situation types can be viewed as the foundation of representation of sentence meaning.Noting that situation types cannot be determined by verbs alone, recent studies often focus on situation type prediction in terms of the combination of different linguistic constituents at the sentence level instead of lexically marked situation types.However, in languages with a fully marked aspectual system, such as Mandarin Chinese, such an approach may miss the opportunity of leveraging lexical aspects as well as other distribution-based lexical cues of event types.

Currently, there is a lack of resources and methods for the identification and validation of the lexical aspect, and this issue is particularly severe for Chinese.From a computational linguistics perspective, the main reason for this shortage stems from the absence bostik universal primer pro of a verified lexical aspect classification system, and consequently, a gold-standard dataset annotated according to this classification system.Additionally, owing to the lack of such a high-quality dataset, it remains unclear whether semantic models, including large general-purpose language models, can actually capture this important yet complex semantic information.

As a result, the true realization of lexical aspect analysis cannot be achieved.To address these two problems, this paper sets out two objectives.First, we aim to construct a high-quality lexical aspect dataset.

Since the classification of the lexical aspect depends baseball scoreboards for sale on how it interacts with aspectual markers, we establish a scientific classification and data construction process through the selection of vocabulary items, the compilation of co-occurrence frequency matrices, and hierarchical clustering.Second, based on the constructed dataset, we separately evaluate the ability of linguistic features and large language model word embeddings to identify lexical aspect categories in order to (1) verify the capacity of semantic models to infer complex semantics and (2) achieve high-accuracy prediction of lexical aspects.Our final classification accuracy is 72.

05%, representing the best result reported thus far.

Report this page