Skip to content

Commit

Permalink
Update researchtheme.md
Browse files Browse the repository at this point in the history
  • Loading branch information
bagustris authored May 12, 2022
1 parent 67a75c9 commit 3dae782
Showing 1 changed file with 13 additions and 5 deletions.
18 changes: 13 additions & 5 deletions researchtheme.md
Original file line number Diff line number Diff line change
@@ -1,13 +1,13 @@
## Research Theme

My research goals aimed at extracting knowledge from acoustic information (aka acoustic features). The examples of this theme are: speech emotion recognition, abnormal sound detection, and audio classication. Moreover, the research can be extended to vibration signal. My approaches to achieve these goal are defined by: (1) data-drive approach (instead of physical modeling), (2) focus on practical implemention (not necessary to follow human mechanism), and robustness (how stable the model given any perturbation instead of correctness). For me, science should be evidence-based, able to be implemented and consistent. My reserach contributes to developing technologies to solve issues in Society 5.0 (What is Society 5.0? [Read here in Indonesian language](http://bagustris.blogspot.com/2022/04/menuju-masyarakat-50-melalui-riset-dan.html)).
My research goals aimed at extracting knowledge from acoustic information (aka acoustic features). The examples of this theme are: speech emotion recognition, abnormal sound detection, and audio classication. Moreover, the research can be extended to vibration signal. My approaches to achieve these goal are defined by: (1) data-driven approach (instead of physical modeling), (2) focus on practical implemention (not necessary to follow human mechanism), and robustness (how stable/consistent the model given any perturbation instead of correctness). For me, science should be evidence-based, able to be implemented and consistent. My research is result-oriented instead of process-oriented. It doesn't mean that process (physical phenomena, modeling, math, and algorithms) is not important. If we understand the process very well, the solution may appears by itself. However, I judge my research mainly based on the results. My research contributes to developing technologies to solve issues in Society 5.0 (What is Society 5.0? [Read here in Indonesian language](http://bagustris.blogspot.com/2022/04/menuju-masyarakat-50-melalui-riset-dan.html)).

![research_concept](images/research_concept.png)

The following is research theme that I offered, particularly (but not limited) for Enginenering Physics students ITS.
*Berikut ini adalah tema riset, khususnya judul-judul TA yang saya tawarkan kepada mahasiswa Departemen Teknik Fisika ITS.*
The following is research theme that I offered, particularly (but NOT limited) for Enginenering Physics students ITS.
<!--- *Berikut ini adalah tema riset, khususnya judul-judul TA yang saya tawarkan kepada mahasiswa Departemen Teknik Fisika ITS.* --->

For undergraduate level, I will try to provide the baseline method, and you will improve the results using the proposed method.
For undergraduate level, I will try to provide the baseline method, and you will improve the results using your proposed method.

1. Speech emotion recognition using multilayer perceptron with CCC loss, dataset: IEMOCAP
2. Indonesian speech recognition using Wav2Vec2/Hubert/WavLM/UniSpeech-SAT, etc.
Expand All @@ -19,7 +19,15 @@ For undergraduate level, I will try to provide the baseline method, and you will
8. Indonesian emotional Speech synthesis Using FastSpeech
~~9. COVID-19 diagnosis using COUGH sound with deep learning~~
10. COVID-19 diagnosis using SPEECH sound with deep learning, dataset: ComPare CSS 2021
11. Predicting Pathological voice disorder with speech processing technique, dataset: SVD, Voiced, HUPA
11. Predicting pathological voice disorder with speech processing technique, dataset: SVD, Voiced, HUPA
12. Detecting of emotion intensity of non-speech sound (laughter, crying, etc.)
13. Detecting/predicting stuttering (bahasa: gagap) in speech with machine learning
14. Predicting the intensities of seven self-reported emotions (Adoration, Amusement, Anxiety, Disgust, Empathic Pain, Fear, Surprise) from user-generated reactions to emotionally evocative videos.
15. Few-shot learning on acoustic data to capture 10 dimensions of emotion reliably perceived in distinct vocal bursts: Awe, Excitement, Amusement, Awkwardness, Fear, Horror, Distress, Triumph, Sadness and Surprise.
16. Multimodal learning (audio+video+text) to capture 10 dimensions of emotion reliably perceived in distinct vocal bursts: Awe, Excitement, Amusement, Awkwardness, Fear, Horror, Distress, Triumph, Sadness and Surprise.
17. Inferring self-reported emotion from multimodal expression, using multi-output regression to predict fine-grained self-report annotations of seven ‘in-the-wild' emotional experiences.



## Other topics/themes:
Read my [papers](https://scholar.google.co.jp/citations?user=xuiLAewAAAAJ&hl=en). Usually, I wrote down the remaining tasks for future work in that topic.
Expand Down

0 comments on commit 3dae782

Please sign in to comment.