การลดสัญญาณรบกวนเติมแต่งแบบคงที่ก่อนการรู้จำเสียงพูด โดยการลบสเปกตรัมแบบปรับค่าถ่วงน้ำหนักได้

เลอเกียรติ ชววิทยากุล

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/12955

Title:	การลดสัญญาณรบกวนเติมแต่งแบบคงที่ก่อนการรู้จำเสียงพูด โดยการลบสเปกตรัมแบบปรับค่าถ่วงน้ำหนักได้
Other Titles:	Additive stationary noise subtraction for speech recognition using adjustable weight spectral subtraction
Authors:	เลอเกียรติ ชววิทยากุล
Advisors:	อติวงศ์ สุชาโต โปรดปราน บุณยพุกกณะ
Other author:	จุฬาลงกรณ์มหาวิทยาลัย. คณะวิศวกรรมศาสตร์
Advisor's Email:	[email protected] ไม่มีข้อมูล
Subjects:	การรู้จำเสียงพูดอัตโนมัติ สัญญาณรบกวนทางไฟฟ้า
Issue Date:	2549
Publisher:	จุฬาลงกรณ์มหาวิทยาลัย
Abstract:	ในระบบรู้จำเสียงพูดนั้น หากมีสัญญาณรบกวนรวมอยู่ในสัญญาณเสียง จะมีผลทำให้ความแม่นยำและประสิทธิภาพในการรู้จำลดลง ทั้งนี้ได้มีงานวิจัยที่ศึกษาขั้นตอนวิธีต่างๆ อย่างกว้างขวาง โดยมีวัตถุประสงค์เพื่อลดสัญญาณรบกวนให้มากที่สุดก่อนผ่านเข้าสู่กระบวนการรู้จำต่อไป ในวิทยานิพนธ์นี้ได้นำเสนอขั้นตอนวิธีการลบสเปกตรัมแบบปรับค่าถ่วงน้ำหนักได้ โดยใช้การประมาณขนาดสัญญาณรบกวนจากสัญญาณเสียงพูดในช่วง 0-100 มิลลิวินาทีแรก ซึ่งถือว่าเป็นช่วงสัญญาณเงียบเพื่อให้เป็นตัวแปรเสริมในการเลือกค่าถ่วงน้ำหนักที่เหมาะสม งานวิจัยนี้นำเสนอการใช้ฟังก์ชันความสัมพันธ์ระหว่างค่าถ่วงน้ำหนัก และค่าเฉลี่ยพลังงานสัญญาณรบกวนแบบเชิงเส้น ซึ่งสามารถเพิ่มความแม่นยำในการรู้จำเสียงพูดได้ดี ในกรณีค่าอัตราส่วนสัญญาณต่อสัญญาณรบกวนมีค่าต่ำ ผลการทดลองแสดงให้เห็นว่าค่าความแม่นยำในการรู้จำมีค่ามากขึ้น เมื่อใช้กับขั้นตอนวิธีที่นำเสนอนี้ โดยเมื่อเปรียบเทียบขั้นตอนวิธีนำเสนอและขั้นตอนวิธีการลบสเปกตรัมแบบการประมาณแม็กซิมัมไลค์ลิฮูค พบว่า ค่าความแม่นยำโดยรวมเมื่อใช้ขั้นตอนวิธีที่นำเสนอดีขึ้น 17.79% เมื่อใช้ชุดข้อมูลเสียงพูดตัวเลข
Other Abstract:	Impact from noisy signal degrades accuracy in speech recognition system. Noise reduction techniques have been developed continuously in order to reduce noise signal from desired speech before it is passed to the recognizer. In this thesis, an adaptive weight spectral subtraction is proposed. The method estimates noise from the first 100 milliseconds of the original speech and threats it as background noise. Then a linear function of weight and the average noise energy is calculated. This method is efficient with speech that SNR is low, Experiments show that the proposed method increases recognition accuracy by 17.79% when compared with maximum likelihood spectral subtraction and tested in single digit spoken speech data.
Description:	วิทยานิพนธ์ (วท.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2549
Degree Name:	วิทยาศาสตรมหาบัณฑิต
Degree Level:	ปริญญาโท
Degree Discipline:	วิทยาศาสตร์คอมพิวเตอร์
URI:	http://cuir.car.chula.ac.th/handle/123456789/12955
URI:	http://doi.org/10.14457/CU.the.2006.468
metadata.dc.identifier.DOI:	10.14457/CU.the.2006.468
Type:	Thesis
Appears in Collections:	Eng - Theses

Files in This Item:

File	Description	Size	Format
Lerkiat_ch.pdf		683.72 kB	Adobe PDF	View/Open

Show full item record