Dat Quoc Nguyen, Dat Nguyen

Dat Quoc Nguyen

Senior Research Scientist
Head of Natural Language Processing department
VinAI Research, Vietnam
Email: v.datnq9 (at) vinai.io
[Twitter] [Github] [Google Scholar]


Resources

For Vietnamese:

Publications

Last updated: 03/06/2023. See my Google Scholar profile for an up-to-date list of publications.
    [2023]
  1. Hung Bui, Dat Quoc Nguyen, Linh Pham and Dinh Phung. 2023. Building and Nurturing AI Development in Vietnam. Communications of the ACM, to appear.
  2. Linh The Nguyen, Thinh Pham and Dat Quoc Nguyen. 2023. XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech. In Proceedings of the 24th Annual Conference of the International Speech Communication Association (INTERSPEECH), to appear. [BibTeX] [Software]
  3. Vinh Tong, Dai Quoc Nguyen, Dinh Phung and Dat Quoc Nguyen. 2023. Two-view Graph Neural Networks for Knowledge Graph Completion. In Proceedings of the 20th Extended Semantic Web Conference (ESWC), pages 262–278. [.pdf] [BibTeX] [Software]
  4. Thien Hai Nguyen, Thinh Pham, Khoi Minh Le, Manh Luong, Nguyen Luong Tran, Hieu Man, Dang Minh Nguyen, Tuan Anh Luu, Thien Huu Nguyen, Hung Bui, Dinh Phung and Dat Quoc Nguyen. 2023. A Vietnamese Spelling Correction System. In Companion Proceedings of the 28th International Conference on Intelligent User Interfaces (IUI), pages 158–161. [BibTeX] [Demo system]
  5. [2022]
  6. Vinh Tong, Dat Quoc Nguyen, Trung Thanh Huynh, Tam Thanh Nguyen, Quoc Viet Hung Nguyen and Mathias Niepert. 2022. Joint Multilingual Knowledge Graph Completion and Alignment. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 4675-4687. [BibTeX] [Software]
  7. Linh The Nguyen and Dat Quoc Nguyen. 2022. Investigating the Impact of ASR Errors on Spoken Implicit Discourse Relation Recognition. In Proceedings of the First Workshop on Transcript Understanding, pages 34-39. [BibTeX]
  8. Mai Hoang Dao, Thinh Hung Truong and Dat Quoc Nguyen. 2022. Disfluency Detection for Vietnamese. In Proceedings of the 8th Workshop on Noisy User-generated Text (WNUT), pages 194-200. [BibTeX] [Data]
  9. Mai Hoang Dao, Thinh Hung Truong and Dat Quoc Nguyen. 2022. From Disfluency Detection to Intent Detection and Slot Filling. In Proceedings of the 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH), pages 1106-1110. [BibTeX] [Data]
  10. Linh The Nguyen*, Nguyen Luong Tran*, Long Doan*, Manh Luong and Dat Quoc Nguyen. 2022. A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation. In Proceedings of the 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH), pages 1726-1730. [BibTeX] [Data]
  11. Nguyen Luong Tran, Duong Minh Le and Dat Quoc Nguyen. 2022. BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese. In Proceedings of the 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH), pages 1751-1755. [BibTeX] [Software]
  12. Thien Hai Nguyen, Tuan-Duy H. Nguyen, Duy Phung, Duy Tran-Cong Nguyen, Hieu Minh Tran, Manh Luong, Tin Duy Vo, Hung Hai Bui, Dinh Phung and Dat Quoc Nguyen. 2022. A Vietnamese-English Neural Machine Translation System. In Proceedings of the 23rd Annual Conference of the International Speech Communication Association: Show and Tell (INTERSPEECH), pages 5543-5544. [BibTeX] [Software] [Demo system]
  13. Tin Duy Vo, Manh Luong, Duong Minh Le, Hieu Minh Tran, Nhan Tri Do, Tuan-Duy H. Nguyen, Thien Hai Nguyen, Hung Hai Bui, Dat Quoc Nguyen and Dinh Phung. 2022. Vietnamese Speech-based Question Answering over Car Manuals. In Companion Proceedings of the 27th International Conference on Intelligent User Interfaces (IUI), pages 117–119. [BibTeX] [Demo video]
  14. Dai Quoc Nguyen*, Vinh Tong*, Dinh Phung and Dat Quoc Nguyen. 2022. Node Co-occurrence based Graph Neural Networks for Knowledge Graph Link Prediction. In Proceedings of the 15th ACM International Conference on Web Search and Data Mining (WSDM), pages 1589-1592. [.pdf] [BibTeX] [Software]
  15. [2021]
  16. Zenan Zhai, Christian Druckenbrodt, Camilo Thorne, Saber A. Akhondi, Dat Quoc Nguyen, Trevor Cohn and Karin Verspoor. 2021. ChemTables: A dataset for semantic classification on tables in chemical patents. Journal of Cheminformatics, 13:97:1-20. (SCIE, JCR IF: 5.514) [BibTeX] [Data]
  17. Long Doan*, Linh The Nguyen*, Nguyen Luong Tran*, Thai Hoang and Dat Quoc Nguyen. 2021. PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 4495-4503. [BibTeX] [Data]
  18. Mai Hoang Dao*, Thinh Hung Truong* and Dat Quoc Nguyen. 2021. Intent Detection and Slot Filling for Vietnamese. In Proceedings of the 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH), pages 4698-4702. [BibTeX] [Data] [Software]
  19. Thinh Hung Truong, Mai Hoang Dao and Dat Quoc Nguyen. 2021. COVID-19 Named Entity Recognition for Vietnamese. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 2146-2153. [BibTeX] [Data]
  20. Linh The Nguyen and Dat Quoc Nguyen. 2021. PhoNLP: A joint multi-task learning model for Vietnamese part-of-speech tagging, named entity recognition and dependency parsing. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations (NAACL), pages 1-7. [BibTeX] [Software]
  21. Jiayuan He, Dat Quoc Nguyen, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Ralph Hoessel, Zubair Afzal, Zenan Zhai, Biaoyan Fang, Hiyori Yoshikawa, Ameer Albahem, Lawrence Cavedon, Trevor Cohn, Timothy Baldwin and Karin Verspoor. 2021. ChEMU 2020: Natural Language Processing Methods are Effective for Information Extraction from Chemical Patents. Frontiers in Research Metrics and Analytics, 6:654438:1-28. [BibTeX]
  22. [2020]
  23. Dat Quoc Nguyen. 2020. A survey of embedding models of entities and relationships for knowledge graph completion. In Proceedings of the 14th Workshop on Graph-based Methods for Natural Language Processing (TextGraphs), pages 1-14. [BibTeX]
  24. Dat Quoc Nguyen, Thanh Vu and Anh Tuan Nguyen. 2020. BERTweet: A pre-trained language model for English Tweets. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP), pages 9-14. [BibTeX] [Software]
  25. Anh Tuan Nguyen, Mai Hoang Dao and Dat Quoc Nguyen. 2020. A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 4079-4085. [BibTeX] [Data]
  26. Dat Quoc Nguyen and Anh Tuan Nguyen. 2020. PhoBERT: Pre-trained language models for Vietnamese. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 1037-1042. [BibTeX] [Software]
  27. Dat Quoc Nguyen*, Thanh Vu*, Afshin Rahimi, Mai Hoang Dao, Linh The Nguyen and Long Doan. 2020. WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets. In Proceedings of the 6th Workshop on Noisy User-generated Text (WNUT), pages 314-318. [BibTeX] [Data]
  28. Dai Quoc Nguyen, Tu Dinh Nguyen, Dat Quoc Nguyen and Dinh Phung. 2020. A Capsule Network-based Model for Learning Node Embeddings. In Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM), pages 3313–3316. [.pdf] [BibTeX] [Software]
  29. Thanh Vu, Dat Quoc Nguyen and Anthony Nguyen. 2020. A Label Attention Model for ICD Coding from Clinical Text. In Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI), pages 3335-3341. [BibTeX] [Software]
  30. Jiayuan He, Dat Quoc Nguyen, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Ralph Hoessel, Zubair Afzal, Zenan Zhai, Biaoyan Fang, Hiyori Yoshikawa, Ameer Albahem, Lawrence Cavedon, Trevor Cohn, Timothy Baldwin and Karin Verspoor. 2020. Overview of ChEMU 2020: Named Entity Recognition and Event Extraction of Chemical Reactions from Patents. In Proceedings of the Eleventh International Conference of the CLEF Association (CLEF), pages 237-254. [.pdf] [BibTeX]
  31. Jiayuan He, Dat Quoc Nguyen and others. 2020. An Extended Overview of the CLEF 2020 ChEMU Lab: Information Extraction of Chemical Reactions from Patents. In Proceedings of the Working Notes of CLEF 2020—Conference and Labs of the Evaluation Forum. [BibTeX]
  32. Mai Hoang Dao and Dat Quoc Nguyen. 2020. VinAI at ChEMU 2020: An accurate system for named entity recognition in chemical reactions from patents. In Proceedings of the Working Notes of CLEF 2020—Conference and Labs of the Evaluation Forum. [BibTeX]
  33. Dat Quoc Nguyen, Zenan Zhai, Hiyori Yoshikawa, Biaoyan Fang, Christian Druckenbrodt, Camilo Thorne, Ralph Hoessel, Saber A. Akhondi, Trevor Cohn, Timothy Baldwin and Karin Verspoor. 2020. ChEMU: Named Entity Recognition and Event Extraction of Chemical Reactions from Patents. In Proceedings of the 42nd European Conference on Information Retrieval (ECIR), pages 572-579. [.pdf] [BibTeX] [Data]
  34. [2019]
  35. Hiyori Yoshikawa, Dat Quoc Nguyen, Zenan Zhai, Christian Druckenbrodt, Camilo Thorne, Saber A. Akhondi, Timothy Baldwin and Karin Verspoor. 2019. Detecting Chemical Reactions in Patents. In Proceedings of the 17th Annual Workshop of the Australasian Language Technology Association (ALTA), pages 100-110 (Best Paper Award). [BibTeX]
  36. Dat Quoc Nguyen. 2019. A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing. In Proceedings of the 17th Annual Workshop of the Australasian Language Technology Association (ALTA), pages 28-34. [BibTeX]
  37. Zenan Zhai, Dat Quoc Nguyen, Saber A. Akhondi, Camilo Thorne, Christian Druckenbrodt, Trevor Cohn, Michelle Gregory and Karin Verspoor. 2019. Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings. In Proceedings of the 18th ACL Workshop on Biomedical Natural Language Processing (BioNLP), pages 328–338. [BibTeX] [Software]
  38. Dai Quoc Nguyen, Thanh Vu, Tu Dinh Nguyen, Dat Quoc Nguyen and Dinh Phung. 2019. A Capsule Network-based Embedding Model for Knowledge Graph Completion and Search Personalization. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 2180-2189. [BibTeX] [Software]
  39. Dat Quoc Nguyen and Karin Verspoor. 2019. End-to-end neural relation extraction using deep biaffine attention. In Proceedings of the 41st European Conference on Information Retrieval (ECIR), pages 729-738. [.pdf] [BibTeX] [Software]
  40. Dat Quoc Nguyen and Karin Verspoor. 2019. From POS tagging to dependency parsing for biomedical event extraction. BMC Bioinformatics, 20:72:1-13. (SCIE, JCR IF: 2.511) [BibTeX] [Software]
  41. Dai Quoc Nguyen, Dat Quoc Nguyen, Tu Dinh Nguyen and Dinh Phung. 2019. A convolutional neural network-based model for knowledge base completion and its application to search personalization. Semantic Web, 10(5):947-960. (SCIE, JCR IF: 3.524) [.pdf] [BibTeX]
  42. [2018]
  43. Zenan Zhai, Dat Quoc Nguyen and Karin Verspoor. 2018. Comparing CNN and LSTM character-level embeddings in BiLSTM-CRF models for chemical and disease named entity recognition. In Proceedings of the 9th International Workshop on Health Text Mining and Information Analysis (LOUHI), pages 38-43. [BibTeX]
  44. Dat Quoc Nguyen and Karin Verspoor. 2018. An improved neural network model for joint POS tagging and dependency parsing. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies (CoNLL), pages 81-91. [BibTeX] [Software]
  45. Dat Quoc Nguyen and Karin Verspoor. 2018. Convolutional neural networks for chemical-disease relation extraction are improved with character-based word embeddings. In Proceedings of the 17th ACL Workshop on Biomedical Natural Language Processing (BioNLP), pages 129-136. [BibTeX]
  46. Thanh Vu, Dat Quoc Nguyen, Xuan-Son Vu, Dai Quoc Nguyen, Michael Catt and Michael Trenell. 2018. NIHRIO at SemEval-2018 Task 3: A Simple and Accurate Neural Network Model for Irony Detection in Twitter. In Proceedings of the 12th International Workshop on Semantic Evaluation (SemEval), pages 525-530. [BibTeX] [Software]
  47. Dai Quoc Nguyen, Tu Dinh Nguyen, Dat Quoc Nguyen and Dinh Phung. 2018. A Novel Embedding Model for Knowledge Base Completion Based on Convolutional Neural Network. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 327-333. [BibTeX] [Software]
  48. Thanh Vu, Dat Quoc Nguyen, Dai Quoc Nguyen, Mark Dras and Mark Johnson. 2018. VnCoreNLP: A Vietnamese Natural Language Processing Toolkit. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations (NAACL), pages 56-60. [BibTeX] [Software]
  49. Dat Quoc Nguyen, Dai Quoc Nguyen, Thanh Vu, Mark Dras and Mark Johnson. 2018. A Fast and Accurate Vietnamese Word Segmenter. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC), pages 2582-2587. [BibTeX] [Software]
  50. [2017]
  51. Dat Quoc Nguyen, Thanh Vu, Dai Quoc Nguyen, Mark Dras and Mark Johnson. 2017. From Word Segmentation to POS Tagging for Vietnamese. In Proceedings of the 15th Annual Workshop of the Australasian Language Technology Association (ALTA), pages 108-113. [BibTeX] [Software]
  52. Dai Quoc Nguyen, Dat Quoc Nguyen, Cuong Xuan Chu, Stefan Thater and Manfred Pinkal. 2017. Sequence to Sequence Learning for Event Prediction. In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP), pages 37-42. [BibTeX] [Data]
  53. Dat Quoc Nguyen. 2017. Modeling Topics and Knowledge Bases with Vector Representations. PhD thesis, Macquarie University, Australia. [.pdf] [BibTeX]
  54. Dat Quoc Nguyen, Mark Dras and Mark Johnson. 2017. A Novel Neural Network Model for Joint POS Tagging and Graph-based Dependency Parsing. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies (CoNLL), pages 134-142. [BibTeX] [Software]
  55. Dai Quoc Nguyen, Dat Quoc Nguyen, Ashutosh Modi, Stefan Thater and Manfred Pinkal. 2017. A Mixture Model for Learning Multi-Sense Word Embeddings. In Proceedings of the 6th Joint Conference on Lexical and Computational Semantics (*SEM), pages 121-127. [BibTeX]
  56. Thanh Vu*, Dat Quoc Nguyen*, Mark Johnson, Dawei Song and Alistair Willis. 2017. Search Personalization with Embeddings. In Proceedings of the 39th European Conference on Information Retrieval (ECIR), pages 598-604. [.pdf] [BibTeX]
  57. Dat Quoc Nguyen*, Dai Quoc Nguyen* and Son Bao Pham. 2017. Ripple Down Rules for Question Answering. Semantic Web, 8(4):511-532. (SCIE, JCR IF: 2.224) [.pdf] [BibTeX]
  58. [2016]
  59. Dat Quoc Nguyen, Mark Dras and Mark Johnson. 2016. An empirical study for Vietnamese dependency parsing. In Proceedings of the 14th Annual Workshop of the Australasian Language Technology Association (ALTA), pages 143-149. [BibTeX]
  60. Dat Quoc Nguyen, Kairit Sirts, Lizhen Qu and Mark Johnson. 2016. Neighborhood Mixture Model for Knowledge Base Completion. In Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL), pages 40-50. [BibTeX] [Software]
  61. Dat Quoc Nguyen, Kairit Sirts, Lizhen Qu and Mark Johnson. 2016. STransE: a novel embedding model of entities and relationships in knowledge bases. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 460-466. [BibTeX] [Software]
  62. Didi Surian, Dat Quoc Nguyen, Georgina Kennedy, Mark Johnson, Enrico Coiera and Adam G. Dunn. 2016. Characterizing Twitter Discussions About HPV Vaccines Using Topic Modeling and Community Detection. Journal of Medical Internet Research, 18(8):e232. (SCIE, JCR IF: 5.175) [BibTeX]
  63. Dat Quoc Nguyen*, Dai Quoc Nguyen*, Dang Duc Pham and Son Bao Pham. 2016. A Robust Transformation-Based Learning Approach Using Ripple Down Rules for Part-Of-Speech Tagging. AI Communications, 29(3):409-422. (SCIE, JCR IF: 0.654) [.pdf] [BibTeX]
  64. [2015]
  65. Dat Quoc Nguyen, Kairit Sirts and Mark Johnson. 2015. Improving Topic Coherence with Latent Feature Word Representations in MAP Estimation for Topic Modeling. In Proceedings of the 13th Annual Workshop of the Australasian Language Technology Association (ALTA), pages 116-121. [BibTeX] [Software]
  66. Dat Quoc Nguyen, Richard Billingsley, Lan Du and Mark Johnson. 2015. Improving Topic Models with Latent Feature Word Representations. Transactions of the Association for Computational Linguistics (TACL), 3:299-313. [BibTeX] [Software] [Data]
  67. [2014]
  68. Dat Quoc Nguyen, Dai Quoc Nguyen, Son Bao Pham, Phuong-Thai Nguyen and Minh Le Nguyen. 2014. From Treebank Conversion to Automatic Dependency Parsing for Vietnamese. In Proceedings of 19th International Conference on Application of Natural Language to Information Systems (NLDB), pages 196-207. [.pdf] [BibTeX] [Data]
  69. Dai Quoc Nguyen, Dat Quoc Nguyen, Thanh Vu and Son Bao Pham. 2014. Sentiment Classification on Polarity Reviews: An Empirical Study Using Rating-based Features. In Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), pages 128-135. [BibTeX] [Data]
  70. Dat Quoc Nguyen, Dai Quoc Nguyen, Dang Duc Pham and Son Bao Pham. 2014. RDRPOSTagger: A Ripple Down Rules-based Part-Of-Speech Tagger. In Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pages 17-20. [BibTeX] [Software]
  71. [2013]
  72. Dai Quoc Nguyen, Dat Quoc Nguyen and Son Bao Pham. 2013. A Two-Stage Classifier for Sentiment Analysis. In Proceedings of the 6th International Joint Conference on Natural Language Processing (IJCNLP), pages 897-901. [BibTeX]
  73. Giang Binh Tran, Mohammad Alrifai and Dat Quoc Nguyen. 2013. Predicting Relevant News Events for Timeline Summaries. In Companion Proceedings of the 22nd International Conference on World Wide Web (WWW), pages 91-92. [.pdf] [BibTeX] [Data]
  74. Dat Quoc Nguyen, Dai Quoc Nguyen and Son Bao Pham. 2013. KbQAS: A Knowledge-based QA System. In Proceedings of the 12th International Semantic Web Conference: Posters and Demonstrations Track (ISWC), pages 109-112. [BibTeX] [Demo video]
  75. [Before 2013]
  76. Dai Quoc Nguyen, Dat Quoc Nguyen and Son Bao Pham. 2012. A Vietnamese Text-based Conversational Agent. In Proceedings of the 25th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE), pages 699-708. [.pdf] [BibTeX]
  77. Dai Quoc Nguyen, Dat Quoc Nguyen and Son Bao Pham. 2012. A Semantic Approach for Question Analysis. In Proceedings of the 25th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE), pages 156-165. [.pdf] [BibTeX]
  78. Dat Quoc Nguyen, Dai Quoc Nguyen, Son Bao Pham and Dang Duc Pham. 2011. Ripple Down Rules for Part-Of-Speech Tagging. In Proceedings of the 12th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), pages 190-201. [.pdf] [BibTeX]
  79. Thanh Vu and Dat Quoc Nguyen. 2011. A Vietnamese Information Retrieval System for Product-Price. In Proceedings of the 2011 IEEE International Conference on Granular Computing (GrC), pages 691-696. [.pdf] [BibTeX]
  80. Dat Quoc Nguyen*, Dai Quoc Nguyen* and Son Bao Pham. 2011. Systematic Knowledge Acquisition for Question Analysis. In Proceedings of the 8th International Conference on Recent Advances in Natural Language Processing (RANLP), pages 406-412. [BibTeX]
  81. Dai Quoc Nguyen, Dat Quoc Nguyen, Khoi Trong Ma and Son Bao Pham. 2011. Automatic Ontology Construction from Vietnamese text. In Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE), pages 485-488. [.pdf] [BibTeX]
  82. Dai Quoc Nguyen, Dat Quoc Nguyen and Son Bao Pham. 2009. A Vietnamese Question Answering System. In Proceedings of the 2009 International Conference on Knowledge and Systems Engineering (KSE), pages 26-32. [.pdf] [BibTeX]
  83. Dat Quoc Nguyen, Dai Quoc Nguyen, Son Bao Pham and The Duy Bui. 2009. A Fast Template-based Approach to Automatically Identify Primary Text Content of a Web Page. In Proceedings of the 2009 International Conference on Knowledge and Systems Engineering (KSE), pages 232-236. [.pdf] [BibTeX]

Talks & Panels

Academic service