I. Patents
2022
ID | Patent Titles | authors | link |
---|---|---|---|
CN-114945892-A | 播放音频的方法、装置、系统、设备及存储介质 | 曹翔, 汤戈, 徐豪杰, 王征韬, 雷兆恒 | https://patents.google.com/patent/CN114945892A/zh |
CN-114936996-A | 一种图像检测方法、装置、智能设备及存储介质 | 洪国伟, 曹成志, 董治, 雷兆恒 | https://patents.google.com/patent/CN114936996A/zh |
CN-114329043-A | Audio essence fragment determination method, electronic equipment and computer-readable storage medium | 毛绮雯, 陈肇康, 吴斌, 雷兆恒 | https://patents.google.com/patent/CN114329043A/en |
CN-114067840-A | 生成音乐视频的方法、存储介质和电子设备 | 梅立锋, 杨跃, 董治, 雷兆恒 | https://patents.google.com/patent/CN114067840A/zh |
CN-113963397-A | 图像处理方法、服务器以及存储介质 | 杨跃, 董治, 雷兆恒 | https://patents.google.com/patent/CN113963397A/zh |
CN-113902989-A | 直播场景检测方法、存储介质及电子设备 | 洪国伟, 曹成志, 曾裕斌, 董治, 雷兆恒 | https://patents.google.com/patent/CN113902989A/zh |
CN-113377331-B | Audio data processing method, device, equipment and storage medium | 余菲, 孔令城, 赵伟峰, 雷兆恒, 周文江 | https://patents.google.com/patent/CN113377331B/en |
CN-113393830-B | 混合声学模型训练及歌词时间戳生成方法、设备、介质 | 张斌, 赵伟峰, 雷兆恒, 周文江, 张柏生, 李幸烨, 苑文波, 杨小康, 李童, 林艳秋, 曹利, 代玥, 胡鹏 | https://patents.google.com/patent/CN113393830B/zh |
CN-113901894-A | Video generation method, device, server and storage medium | 杨跃, 董治, 雷兆恒, 梅立锋 | https://patents.google.com/patent/CN113901894A/en |
CN-113888534-A | 一种图像处理方法、电子设备及可读存储介质 | 曾梓华, 董治, 雷兆恒 | https://patents.google.com/patent/CN113888534A/zh |
2021
id | patent title | authors | link |
---|---|---|---|
CN-113724136-A | Video restoration method, device and medium | 曾裕斌, 洪国伟, 董治, 雷兆恒 | https://patents.google.com/patent/CN113724136A/en |
CN-113689440-A | Video processing method and device, computer equipment and storage medium | 黄均昕, 杨跃, 董治, 雷兆恒 | https://patents.google.com/patent/CN113689440A/en |
CN-113610012-A | Video detection method, electronic device and computer-readable storage medium | 洪国伟, 曹成志, 曾裕斌, 董治, 雷兆恒 | https://patents.google.com/patent/CN113610012A/en |
CN-113569809-A | Image processing method, device and computer readable storage medium | 魏旭东, 杨跃, 董治, 雷兆恒 | https://patents.google.com/patent/CN113569809A/en |
CN-113516762-A | Image processing method and device | 杨跃, 董治, 雷兆恒 | https://patents.google.com/patent/CN113516762A/en |
CN-113505707-A | 吸烟行为检测方法、电子设备及可读存储介质 | 洪国伟, 曹成志, 雷兆恒 | https://patents.google.com/patent/CN113505707A/zh |
CN-113868463-A | Recommendation model training method and device | 龚韬, 赵伟峰, 胡诗超, 陈洲旋, 顾旻玮, 马小栓, 蔡宗颔, 雷兆恒, 周文江 | https://patents.google.com/patent/CN113868463A/en |
CN-113486672-A | Method for disambiguating polyphone, electronic device and computer readable storage medium | 杨宜涛, 徐东, 陈洲旋, 赵伟峰, 雷兆恒, 周文江 | https://patents.google.com/patent/CN113486672A/en |
CN-113473201-A | Audio and video alignment method, device, equipment and storage medium | 杨跃, 董治, 雷兆恒 | https://patents.google.com/patent/CN113473201A/en |
CN-113393830-A | 混合声学模型训练及歌词时间戳生成方法、设备、介质 | 张斌, 赵伟峰, 雷兆恒, 周文江, 张柏生, 李幸烨, 苑文波, 杨小康, 李童, 林艳秋, 曹利, 代玥, 胡鹏 | https://patents.google.com/patent/CN113393830A/zh |
CN-113377331-A | 一种音频数据处理方法、装置、设备及存储介质 | 余菲, 孔令城, 赵伟峰, 雷兆恒, 周文江 | https://patents.google.com/patent/CN113377331A/zh |
CN-113257222-A | Method, terminal and storage medium for synthesizing song audio | 周思瑜, 庄晓滨, 徐东, 赵伟峰, 吴斌, 雷兆恒, 胡鹏 | https://patents.google.com/patent/CN113257222A/en |
CN-113192484-A | 基于文本生成音频的方法、设备和存储介质 | 徐东, 邓一平, 陈洲旋, 鲁霄, 余洋洋, 陈苑苑, 邢佳佳, 陈纳珩, 周思瑜, 赵伟峰, 周蓝珺, 易越, 许瑶, 唐志彬, 曹利, 雷兆恒, 潘树燊, 周文江 | https://patents.google.com/patent/CN113192484A/zh |
WO-2021139535-A1 | Method, apparatus and system for playing audio, and device and storage medium | 曹翔, 汤戈, 徐豪杰, 王征韬, 雷兆恒 | https://patents.google.com/patent/WO2021139535A1/en |
CN-113077815-A | 一种音频评估方法及组件 | 夏志强, 吴斌, 雷兆恒, 王征韬 | https://patents.google.com/patent/CN113077815A/zh |
CN-109903784-B | 一种拟合失真音频数据的方法及装置 | 陈颖, 赵伟峰, 张庆, 雷兆恒, 王征韬, 孔令城, 徐东, 杨伟明, 陈洲旋, 鲁霄 | https://patents.google.com/patent/CN109903784B/zh |
CN-112445933-A | Model training method, device, equipment and storage medium | 陈肇康, 林梅露, 吴斌, 雷兆恒 | https://patents.google.com/patent/CN112445933A/en |
CN-112257781-A | Model training method and device | 林梅露, 陈肇康, 夏志强, 吴斌, 雷兆恒 | https://patents.google.com/patent/CN112257781A/en |
CN-112231511-A | Neural network model training method and song mining method and device | 夏志强, 吴斌, 雷兆恒 | https://patents.google.com/patent/CN112231511A/en |
CN-112183946-A | Multimedia content evaluation method, device and training method thereof | 关文婕, 吴斌, 雷兆恒 | https://patents.google.com/patent/CN112183946A/en |
2020
id | patent title | authors | link |
---|---|---|---|
CN-111414513-A | Music genre classification method and device and storage medium | 林梅露, 吴康健, 吴斌, 王征韬, 夏志强, 雷兆恒 | https://patents.google.com/patent/CN111414513A/en |
CN-111261185-A | 播放音频的方法、装置、系统、设备及存储介质 | 曹翔, 汤戈, 徐豪杰, 王征韬, 雷兆恒 | https://patents.google.com/patent/CN111261185A/zh |
2019
ID | Patent Title | Autors | Link |
---|---|---|---|
CN-110516104-A | Song recommendations method, apparatus and computer storage medium | 张斌, 王征韬, 吴斌, 雷兆恒 | https://patents.google.com/patent/CN110516104A/en |
CN-110472096-A | Management method, device, equipment and the storage medium of library | 杨伟明, 赵伟峰, 雷兆恒 | https://patents.google.com/patent/CN110472096A/en |
CN-109903784-A | A kind of method and device of fitting distortion audio data | 陈颖, 赵伟峰, 张庆, 雷兆恒, 王征韬, 孔令城, 徐东, 杨伟明, 陈洲旋, 鲁霄 | https://patents.google.com/patent/CN109903784A/en |
II. Publications
2023
Ju, Y., Xu, C., Guo, Y., Li, J., & Lui, S. (2023). Improving Automatic Singing Skill Evaluation with Timbral Features, Attention, and Singing Voice Separation. ICME 2023.
2022
Xu, L., Wang, Z., Wu, B., & Lui, S. (2022). MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis. Accepted in CVPR 2022
Zhang, B., Wang, W., Zhao, E., & Lui, S. (2022). Lyrics-to-audio alignment for dynamic lyric generation. Music Inf. Retrieval Eval. eXchange Audio-Lyrics Alignment Challenge.
2021
Zhuang, X., Yu, H., Zhao, W., Jiang, T., Hu, P., Lui, S., & Zhou, W. (2021). KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke. arXiv preprint arXiv:2110.09121.
S. Hu, B. Liang, Z. Chen, X. Lu, E. Zhao and S. Lui, “Large-scale singer recognition using deep metric learning: an experimental study,” 2021 International Joint Conference on Neural Networks (IJCNN), 2021, pp. 1-6, doi: 10.1109/IJCNN52387.2021.9533911.
Zhuang, X., Jiang, T., Chou, S. Y., Wu, B., Hu, P., & Lui, S. (2021, June). Litesing: Towards fast, lightweight and expressive singing voice synthesis. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 7078-7082). IEEE.
Zeng, Y., Xiao, Z., Hung, K. W., & Lui, S. (2021). Real-time video super resolution network using recurrent multi-branch dilated convolutions. Signal Processing: Image Communication, 93, 116167.
Xiao, Z., Zhang, Z., Hung, K. W., & Lui, S. (2021). Real-time video super-resolution using lightweight depthwise separable group convolutions with channel shuffling. Journal of Visual Communication and Image Representation, 75, 103038.
2020
Hu, S., Zhang, B., Liang, B., Zhao, E., & Lui, S. (2020). Phase-aware music super-resolution using generative adversarial networks. arXiv preprint arXiv:2010.04506. Interspeech 2020.
Jin, C., Wang, T., Liu, S., Tie, Y., Li, J., Li, X., & Lui, S. (2020). A transformer-based model for multi-track music generation. International Journal of Multimedia Data Engineering and Management (IJMDEM), 11(3), 36-54.
Lin, K. W. E., Balamurali, B. T., Koh, E., Lui, S., & Herremans, D. (2020). Singing voice separation using a deep convolutional neural network trained by ideal binary mask and cross entropy. Neural Computing and Applications, 32(4), 1037-1050.
2019
Agres, K., Lui, S., & Herremans, D. (2019, August). A novel music-based game with motion capture to support cognitive and motor function in the elderly. In 2019 IEEE Conference on Games (CoG) (pp. 1-4). IEEE.
Balamurali, B. T., Lin, K. E., Lui, S., Chen, J. M., & Herremans, D. (2019). Toward robust audio spoofing detection: A detailed comparison of traditional and learned features. IEEE Access, 7, 84229-84241.
Zhao, D., Lee, J. S. A., Tan, C. T., Dancu, A., Lui, S., Shen, S., & Mueller, F. F. (2019, June). GameLight-Gamification of the Outdoor Cycling Experience. In Companion Publication of the 2019 on Designing Interactive Systems Conference 2019 Companion (pp. 73-76).
Hee, H. I., Balamurali, B. T., Karunakaran, A., Herremans, D., Teoh, O. H., Lee, K. P., … & Chen, J. M. (2019). Development of machine learning for asthmatic and healthy voluntary cough sounds: A proof of concept study. Applied Sciences, 9(14), 2833.
2018
Agus, N., Anderson, H., Chen, J. M., Lui, S., & Herremans, D. (2018). Minimally simple binaural room modeling using a single feedback delay network. Journal of the Audio Engineering Society, 66(10), 791-807.
Agus, N., Anderson, H., Chen, J. M., Lui, S., & Herremans, D. (2018). Perceptual evaluation of measures of spectral variance. The Journal of the Acoustical Society of America, 143(6), 3300-3311.
Upadhyay, R., & Lui, S. (2018, January). Foreign English accent classification using deep belief networks. In 2018 IEEE 12th international conference on semantic computing (ICSC) (pp. 290-293). IEEE.
2017
Anderson, H., Agus, N., Chen, J. M., & Lui, S. (2017). Modeling the Proportion of Early and Late Energy in Two-Stage Reverberators. Journal of the Audio Engineering Society, 65(12), 1017-1031.
Lui, S., & Grunberg, D. (2017, December). Using skin conductance to evaluate the effect of music silence to relieve and intensify arousal. In 2017 international conference on orange technologies (ICOT) (pp. 91-94). IEEE.
Fang, J., Grunberg, D., Lui, S., & Wang, Y. (2017, December). Development of a music recommendation system for motivating exercise. In 2017 International Conference on Orange Technologies (ICOT) (pp. 83-86). IEEE.
Hee, H. I., Chen, J., & Lui, S. (2017). Intuitive Interactive Platform for Preoperative Communication Between Hospital and Patients/Caregivers: Towards Community Partnership for Peri-Operative Person-Based Healthcare Model. Iproceedings, 3(1), e8425.
Agus, N., Anderson, H., Chen, J. M., & Lui, S. (2017). Energy-Based Binaural Acoustic Modeling. Technical Report 1, Singapore University of Technology and Design.(2017 Apr.) https://istd. sutd. edu. sg/research/technicalreports/energy-based-binaural-acoustic-modeling.
Lin, K. W. E., Anderson, H., So, C., & Lui, S. (2017). Sinusoidal Partials Tracking for Singing Analysis Using the Heuristic of the Minimal Frequency and Magnitude Difference. In INTERSPEECH (pp. 3038-3042).
2016
Khwaja, M. K., Vikash, P., Arulmozhivarman, P., & Lui, S. (2016). Robust phoneme classification for automatic speech recognition using hybrid features and an amalgamated learning model. International Journal of Speech Technology, 19(4), 895-905.
Lee, H., Yoong, A. C. H., Lui, S., Vaniyar, A., & Balasubramanian, G. (2016, November). Design exploration for the” squeezable” interaction. In Proceedings of the 28th Australian Conference on Computer-Human Interaction (pp. 586-594).
2015
Tan, C. T., Byrne, R., Lui, S., Liu, W., & Mueller, F. (2015). JoggAR: a mixed-modality AR approach for technology-augmented jogging. In SIGGRAPH Asia 2015 Mobile Graphics and Interactive Applications (pp. 1-1).
Anderson, H., Lin, K. W. E., So, C., & Lui, S. (2015, October). Flatter frequency response from feedback delay network reverbs. In ICMC.
Trochidis, K., & Lui, S. (2015, June). Modeling affective responses to music using audio signal analysis and physiology. In International symposium on computer music multidisciplinary research (pp. 346-357). Springer, Cham.
Anderson, H., Lin, K. W. E., Agus, N., & Lui, S. (2015, May). Major thirds: a better way to tune your ipad. In NIME (pp. 365-368).
Leslie, G., Picard, R., & Lui, S. (2015). An EEG and Motion Capture Based Expressive Music Interface for Affective Neurofeedback. In Proc. 1st Int. BCMI Workshop.
Lui, S. (2015, May). Generate expressive music from picture with a handmade multi-touch music table. In NIME (pp. 374-377).
Hoon, L. T., Vuyyuru, M. R., Kumar, T. A., & Lui, S. (2015). Binaural Navigation for the Visually Impaired with a Smartphone. In ICMC.