From POS tagging to dependency parsing for biomedical event extraction

AI-generated keywords: Biomedical event extraction POS tagging dependency parsing syntactic processing neural models

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors Dat Quoc Nguyen and Karin Verspoor focus on relation and event extraction from biomedical research publications.
  • They highlight the significance of syntactic information in this process and aim to determine effective approaches to syntactic processing in the biomedical domain.
  • The researchers compare traditional feature-based models with neural network-based models for POS tagging and dependency parsing tasks using GENIA and CRAFT corpora.
  • Their analysis shows that neural models generally outperform feature-based models on these benchmark datasets.
  • The study includes a task-oriented evaluation to assess how parsing models impact downstream applications like biomedical event extraction.
  • Results indicate that superior intrinsic parsing performance doesn't always lead to better extrinsic event extraction performance, showing the complexity of integrating syntactic processing into practical applications.
  • Nguyen and Verspoor provide a detailed exploration of traditional and neural network-based models for POS tagging and dependency parsing in biomedicine, emphasizing the importance of parser selection for optimizing downstream task performance.
  • Retrained models from their study are publicly available for further research at https://github.com/datquocnguyen/BioPosDep.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Dat Quoc Nguyen, Karin Verspoor

Accepted for publication in BMC Bioinformatics

Abstract: Background: Given the importance of relation or event extraction from biomedical research publications to support knowledge capture and synthesis, and the strong dependency of approaches to this information extraction task on syntactic information, it is valuable to understand which approaches to syntactic processing of biomedical text have the highest performance. Results: We perform an empirical study comparing state-of-the-art traditional feature-based and neural network-based models for two core natural language processing tasks of part-of-speech (POS) tagging and dependency parsing on two benchmark biomedical corpora, GENIA and CRAFT. To the best of our knowledge, there is no recent work making such comparisons in the biomedical context; specifically no detailed analysis of neural models on this data is available. Experimental results show that in general, the neural models outperform the feature-based models on two benchmark biomedical corpora GENIA and CRAFT. We also perform a task-oriented evaluation to investigate the influences of these models in a downstream application on biomedical event extraction, and show that better intrinsic parsing performance does not always imply better extrinsic event extraction performance. Conclusion: We have presented a detailed empirical study comparing traditional feature-based and neural network-based models for POS tagging and dependency parsing in the biomedical context, and also investigated the influence of parser selection for a biomedical event extraction downstream task. Availability of data and material: We make the retrained models available at https://github.com/datquocnguyen/BioPosDep

Submitted to arXiv on 11 Aug. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1808.03731v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their study titled "From POS tagging to dependency parsing for biomedical event extraction," authors Dat Quoc Nguyen and Karin Verspoor address the crucial task of relation and event extraction from biomedical research publications. They emphasize the significance of syntactic information in this process and aim to determine the most effective approaches to syntactic processing in the biomedical domain. The researchers conduct an empirical investigation comparing traditional feature-based models with neural network-based models for part-of-speech (POS) tagging and dependency parsing tasks using two widely recognized biomedical corpora, GENIA and CRAFT. Their analysis reveals that, overall, neural models outperform feature-based models on these benchmark datasets. This finding is particularly noteworthy as there has been a lack of recent comparative studies focusing on neural models in the context of biomedical text analysis. Furthermore, the study includes a task-oriented evaluation to assess how these parsing models impact downstream applications such as biomedical event extraction. Surprisingly, the results indicate that superior intrinsic parsing performance does not always translate to better extrinsic event extraction performance, highlighting the complexity of integrating syntactic processing into practical applications. In conclusion, Nguyen and Verspoor present a detailed empirical exploration of traditional and neural network-based models for POS tagging and dependency parsing in biomedicine. Their work sheds light on the importance of parser selection in optimizing performance for downstream tasks like event extraction. The retrained models from their study are made publicly available for further research at https://github.com/datquocnguyen/BioPosDep. This research, accepted for publication in BMC Bioinformatics, contributes valuable insights into enhancing information extraction processes from biomedical literature through advanced syntactic processing techniques.
Created on 27 Apr. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.