Data Representation and Learning for Dialogue System
Summary | This project contains the following key points: 1. With the main goal of "helping relieve the stress problems faced by NYCU students", we aim to build a chatbot for mental health support. 2. The constructed prototype of the dialogue model for mental health support will integrate rule-based functions and learning-based components: it will cover all user input while making it controllable. 3. The interactive dialogue data between users and the system will be collected through the designed dialogue system, and the dialogue data will be sorted and marked by cooperating with the professional team of psychological counseling. 4. The designed dialogue system could use both voice and text as its inputs, which is different from many previous works: through voice input, the system will have higher flexibility in different application scenarios. 5. Add virtual characters to the user interface: Increase the realism and fun of interacting with the chatbot to increase the willingness of users to participate. 6. The robust system that can deal with noise and diverse input data: by training a natural language understanding module (NLU) with a large number of adversarial examples, the dialog manager could provide correct decisions in complex situations. 7. Another perspective on solving the problem of posterior collapse in natural sentence generation will be presented: by adjusting the specification of the flow model, we could decide the KL divergence between the prior and the posterior to provide greater flexibility than previous works. 8. Bilingual dialogue system: It can operate between English and Chinese through the data in the MultiWOZ and CrossWOZ datasets. 9. In this project, we will collect relevant dialogue data for subsequent training of the model. The quality of the dataset is one of the important factors affecting the performance of deep learning models, so we refer to the previous paper Sugariness prediction of Syzygium samarangense using convolutional learning of hyperspectral images, published in the top journal Scientific Reports, which uses deep learning algorithms method to analyze the spectrum of hyperspectral images in fruit trees. The Brix value of the predicted fruit tree fruit is output through regression analysis. During the collection process, it is necessary to pre-treat the collected fruits by returning to temperature. The fruit is sliced, and hyperspectral data of the fruit are obtained using a hyperspectral instrument. Its data collection and processing procedures will be adopted in the subsequent data set creation procedures. |
||
---|---|---|---|
Technical Film | |||
Keyword | Dialogue system Automatic speech recognition Natural language understanding Natural language generation Text to speech | ||
Research Project | Data RepresentationLearning for Dialogue System | ||
Research Team | Led by PI:Prof. Jen-Tzung Chien, National Yang Ming Chiao Tung University, Co-PI: Prof. Jen-Tzung Chien, National Yang Ming Chiao Tung University |
More like this
Provide the latest information of AI research centers and applied industries
-
Snippet Policy Network: Knee-Guided Neuroevolution for Multi-Lead ECG Early Classification
We have proposed in this project the first time series classification technique that considers accuracy, earliness, and varied lengths simultaneously, containing a novel deep reinforcement learning framework and a new multi-objective optimization neural network algorithm. The proposed technique is fit for the problem of early classification of cardiovascular diseases based on ECG signals and shown to deliver the best performance in this area, holding the leading position worldwide.
-
Integration of an ICU Visualization Dashboard (i-Dashboard) as a Platform to Facilitate Multidisciplinary Rounds
Multidisciplinary rounds (MDRs) are scheduled, patient-focused communication mechanisms among multidisciplinary providers in the intensive care unit (ICU). The surgical ICU team of National Cheng Kung University Hospital has developed and integrated i-Dashboard as a platform to facilitate MDRs. i-Dashboard is a custom-developed visualization dashboard that supports (1) key information retrieval and reorganization, (2) time-series data, and (3) display on large touchscreens during MDRs. The i-Dashboard increases the efficiency in data gathering and enhances communication accuracy and information exchange in MDRs.
-
Super-fast Convergence for Radiance Fields Reconstruction
The NeRF-based technique describes a super-fast convergence approach to reconstructing the per-scene radiance field from a set of images that capture the scene with known poses.
-
Machine Learning to Predict In-Hospital Cardiac Arrest in Patients Admitted from the Emergency Department with COVID-19 and Suspected Pneumonia
By using the machine learning algorithms, this study developed a risk stratification model for predicting the occurrence of in-hospital cardiac arrest (IHCA) events in patients admitted from the emergency department with COVID-19 and pneumonia. The results showed that the model's performance is better than by using the National Early Warning Score (NEWS).
-
Cardiovascular Health Guardian – Novel Pulse Wave Velocity and Personal Blood Pressure Estimation System for Smart Watch
Our team develops an accurate PWV estimation algorithm that uses wrist PPG and ECG signals from wearable devices. A missing-feature imputation and ambiguous-feature resolution technique is developed and the availability of wrist PPG morphological features is raised from 60% to 99.1%. A weighted pulse decomposition approach is adopted and 5 component waves can be acquired to examine more detailed properties. The PWV is then estimated by XGBoost algorithm with the hierarchical regression model.
-
HeaortaNet (Automatic Pericardium/Aorta Segmentation AI Model [HeaortaNet])
The Pericardium/Aorta Segmentation and Cardiovascular Risk Prediction AI Total Solution Model, HeaortaNet, is a deep learning model based on UNet and attention gate, and had been trained by >70,000 axial images with verified annotations of the pericardium and aorta. It shortens the time for data processing from 60 minutes, by manual segmentation of both pericardium and aorta, to 0.4 seconds. The segmentation accuracy is 94.8% for the pericardium, and 91.6% for the aorta. The applicability of HeaortaNet had been demonstrated by analyzing the non-contrast chest CT scans (>5,000 cases) deposited in the mega-image bank of National Health Insurance Databank.
-
Embedding multimodal machine intelligence in the digital life of AI technology
This project collaborates with the international team to collect a very large-scale Chinese emotional corpus. In terms of technology, the fairness of speech emotion recognition is also discussed to solve social issues that may be encountered regarding the usability of emotion recognition. Among them, it is found that the database annotations are all labeled with the unfair perspective of men and women, which leads to biases in the trained model. In order to solve this problem, there have been preliminary achievements in the technological development of fairness, and will be submitted in the near future.
-
Deep Reinforcement Learning in Autonomous Miniature Car Racing
This project develops a high-performance end-to-end reinforcement learning training platform for autonomous miniature car racing. With this platform, our team won the championship of Amazon DeepRacer, a world autonomous racing competition. In addition, by combining various reinforcement learning algorithms and frameworks, our self-developed autonomous racing platform can operate at a much higher speed, surpassing the performance of Amazon DeepRacer.
-
Development and application of marine exploration and ecological survey technologies under climate change
The project is to establish an AUV system capable of performing underwater exploration and ecological surveys in various shallow water areas over a long period. This system can automate the collection, analysis, and recording of coral reef ecosystems' imaging, acoustic and hydrological data in designated areas.