{"id":27104,"date":"2025-04-18T10:52:05","date_gmt":"2025-04-18T03:52:05","guid":{"rendered":"https:\/\/interdata.vn\/blog\/?p=27104"},"modified":"2025-04-21T11:34:29","modified_gmt":"2025-04-21T04:34:29","slug":"feature-selection-la-gi","status":"publish","type":"post","link":"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/","title":{"rendered":"Feature Selection l\u00e0 g\u00ec? A-Z v\u1ec1 l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng trong ML"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_84 counter-hierarchy ez-toc-counter ez-toc-white ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">N\u1ed8I DUNG<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Toggle Table of Content\"><span class=\"ez-toc-js-icon-con\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/span><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1 eztoc-toggle-hide-by-default' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Feature-Selection-la-gi\" >Feature Selection l\u00e0 g\u00ec?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Cac-phuong-phap-Feature-Selection-co-giam-sat\" >C\u00e1c ph\u01b0\u01a1ng ph\u00e1p Feature Selection c\u00f3 gi\u00e1m s\u00e1t<\/a><ul class='ez-toc-list-level-3' ><li class='ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Phuong-phap-loc\" >Ph\u01b0\u01a1ng ph\u00e1p l\u1ecdc<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Phuong-phap-bao-boc\" >Ph\u01b0\u01a1ng ph\u00e1p bao b\u1ecdc<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Phuong-phap-nhung\" >Ph\u01b0\u01a1ng ph\u00e1p nh\u00fang<\/a><\/li><\/ul><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Cac-phuong-phap-Feature-Selection-khong-giam-sat\" >C\u00e1c ph\u01b0\u01a1ng ph\u00e1p Feature Selection kh\u00f4ng gi\u00e1m s\u00e1t<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Loi-ich-khi-su-dung-Feature-Selection\" >L\u1ee3i \u00edch khi s\u1eed d\u1ee5ng Feature Selection<\/a><ul class='ez-toc-list-level-3' ><li class='ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Don-gian-hoa-mo-hinh-va-tang-kha-nang-dien-giai\" >\u0110\u01a1n gi\u1ea3n h\u00f3a m\u00f4 h\u00ecnh v\u00e0 t\u0103ng kh\u1ea3 n\u0103ng di\u1ec5n gi\u1ea3i<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-9\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Cai-thien-hieu-suat-du-doan\" >C\u1ea3i thi\u1ec7n hi\u1ec7u su\u1ea5t d\u1ef1 \u0111o\u00e1n<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-10\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Giam-thoi-gian-huan-luyen-va-chi-phi-tinh-toan\" >Gi\u1ea3m th\u1eddi gian hu\u1ea5n luy\u1ec7n v\u00e0 chi ph\u00ed t\u00ednh to\u00e1n<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-11\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Giam-nguy-co-qua-khop-Overfitting\" >Gi\u1ea3m nguy c\u01a1 qu\u00e1 kh\u1edbp (Overfitting)<\/a><\/li><\/ul><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-12\" href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/#Cach-chon-phuong-phap-Feature-Selection\" >C\u00e1ch ch\u1ecdn ph\u01b0\u01a1ng ph\u00e1p Feature Selection<\/a><\/li><\/ul><\/nav><\/div>\n<p>Trong qu\u00e1 tr\u00ecnh x\u00e2y d\u1ef1ng m\u00f4 h\u00ecnh <a href=\"https:\/\/interdata.vn\/blog\/machine-learning-la-gi\/\">Machine Learning<\/a> (H\u1ecdc m\u00e1y), vi\u1ec7c l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng (Feature Selection) \u0111\u00f3ng vai tr\u00f2 v\u00f4 c\u00f9ng quan tr\u1ecdng gi\u00fap c\u1ea3i thi\u1ec7n hi\u1ec7u su\u1ea5t v\u00e0 \u0111\u1ed9 ch\u00ednh x\u00e1c c\u1ee7a m\u00f4 h\u00ecnh. Feature Selection gi\u00fap gi\u1ea3m b\u1edbt s\u1ed1 l\u01b0\u1ee3ng \u0111\u1eb7c tr\u01b0ng \u0111\u1ea7u v\u00e0o, lo\u1ea1i b\u1ecf nh\u1eefng \u0111\u1eb7c tr\u01b0ng kh\u00f4ng li\u00ean quan ho\u1eb7c d\u01b0 th\u1eeba, t\u1eeb \u0111\u00f3 gi\u00fap m\u00f4 h\u00ecnh tr\u1edf n\u00ean g\u1ecdn nh\u1eb9 v\u00e0 d\u1ec5 hi\u1ec3u h\u01a1n.<\/p>\n<p>B\u00e0i vi\u1ebft n\u00e0y s\u1ebd \u0111i s\u00e2u v\u00e0o t\u00ecm hi\u1ec3u<a href=\"https:\/\/interdata.vn\/blog\/feature-selection-la-gi\/\"><strong> Feature Selection l\u00e0 g\u00ec<\/strong><\/a>, t\u00ecm hi\u1ec3u c\u00e1c ph\u01b0\u01a1ng ph\u00e1p Feature Selection c\u00f3 gi\u00e1m s\u00e1t v\u00e0 kh\u00f4ng gi\u00e1m s\u00e1t, c\u00f9ng nh\u1eefng \u01b0u \u0111i\u1ec3m khi s\u1eed d\u1ee5ng ph\u01b0\u01a1ng ph\u00e1p n\u00e0y trong quy tr\u00ecnh h\u1ecdc m\u00e1y. \u0110\u1ecdc ngay!<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Feature-Selection-la-gi\"><\/span><strong>Feature Selection l\u00e0 g\u00ec?<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><strong>Feature Selection (hay L\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng, Ch\u1ecdn l\u1ecdc \u0111\u1eb7c tr\u01b0ng) l\u00e0 qu\u00e1 tr\u00ecnh t\u1ef1 \u0111\u1ed9ng ho\u1eb7c th\u1ee7 c\u00f4ng nh\u1eb1m ch\u1ecdn ra m\u1ed9t t\u1eadp h\u1ee3p con (subset) c\u00e1c \u0111\u1eb7c tr\u01b0ng (features) quan tr\u1ecdng v\u00e0 ph\u00f9 h\u1ee3p nh\u1ea5t t\u1eeb t\u1eadp d\u1eef li\u1ec7u g\u1ed1c<\/strong> \u0111\u1ec3 x\u00e2y d\u1ef1ng m\u00f4 h\u00ecnh Machine Learning (H\u1ecdc m\u00e1y) hi\u1ec7u qu\u1ea3.<\/p>\n<figure id=\"attachment_27109\" aria-describedby=\"caption-attachment-27109\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Feature-Selection-la-gi.jpg\" alt=\"Feature Selection l\u00e0 g\u00ec?\" width=\"800\" height=\"419\" class=\"size-full wp-image-27109\" title=\"\" srcset=\"https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Feature-Selection-la-gi.jpg 800w, https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Feature-Selection-la-gi-300x157.jpg 300w, https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Feature-Selection-la-gi-768x402.jpg 768w, https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Feature-Selection-la-gi-750x393.jpg 750w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-27109\" class=\"wp-caption-text\">Feature Selection l\u00e0 g\u00ec?<\/figcaption><\/figure>\n<p>N\u00f3 kh\u00f4ng t\u1ea1o ra \u0111\u1eb7c tr\u01b0ng m\u1edbi hay bi\u1ebfn \u0111\u1ed5i \u0111\u1eb7c tr\u01b0ng hi\u1ec7n c\u00f3. Thay v\u00e0o \u0111\u00f3, Feature Selection t\u1eadp trung v\u00e0o vi\u1ec7c gi\u1ea3m s\u1ed1 l\u01b0\u1ee3ng \u0111\u1eb7c tr\u01b0ng \u0111\u1ea7u v\u00e0o b\u1eb1ng c\u00e1ch x\u00e1c \u0111\u1ecbnh v\u00e0 lo\u1ea1i b\u1ecf nh\u1eefng \u0111\u1eb7c tr\u01b0ng kh\u00f4ng li\u00ean quan (irrelevant) ho\u1eb7c d\u01b0 th\u1eeba (redundant), ch\u1ec9 gi\u1eef l\u1ea1i nh\u1eefng th\u00f4ng tin c\u1ed1t l\u00f5i.<\/p>\n<p>H\u00e3y t\u01b0\u1edfng t\u01b0\u1ee3ng b\u1ea1n c\u00f3 m\u1ed9t b\u1ed9 s\u01b0u t\u1eadp c\u00f4ng c\u1ee5 l\u1edbn v\u1edbi nhi\u1ec1u m\u00f3n \u0111\u1ed3. Feature Selection gi\u1ed1ng nh\u01b0 vi\u1ec7c b\u1ea1n ch\u1ec9 ch\u1ecdn ra nh\u1eefng c\u00f4ng c\u1ee5 th\u1ef1c s\u1ef1 c\u1ea7n thi\u1ebft v\u00e0 h\u1eefu \u00edch nh\u1ea5t cho c\u00f4ng vi\u1ec7c c\u1ee5 th\u1ec3 s\u1eafp t\u1edbi, gi\u00fap b\u1ea1n l\u00e0m vi\u1ec7c hi\u1ec7u qu\u1ea3 h\u01a1n.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Cac-phuong-phap-Feature-Selection-co-giam-sat\"><\/span><strong>C\u00e1c ph\u01b0\u01a1ng ph\u00e1p Feature Selection c\u00f3 gi\u00e1m s\u00e1t<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>L\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng c\u00f3 gi\u00e1m s\u00e1t s\u1eed d\u1ee5ng bi\u1ebfn m\u1ee5c ti\u00eau \u0111\u1ec3 x\u00e1c \u0111\u1ecbnh c\u00e1c \u0111\u1eb7c tr\u01b0ng quan tr\u1ecdng nh\u1ea5t. V\u00ec c\u00e1c \u0111\u1eb7c tr\u01b0ng d\u1eef li\u1ec7u \u0111\u00e3 \u0111\u01b0\u1ee3c x\u00e1c \u0111\u1ecbnh, nhi\u1ec7m v\u1ee5 l\u00e0 x\u00e1c \u0111\u1ecbnh bi\u1ebfn \u0111\u1ea7u v\u00e0o n\u00e0o \u1ea3nh h\u01b0\u1edfng tr\u1ef1c ti\u1ebfp nh\u1ea5t \u0111\u1ebfn bi\u1ebfn m\u1ee5c ti\u00eau. T\u01b0\u01a1ng quan l\u00e0 ti\u00eau ch\u00ed ch\u00ednh khi \u0111\u00e1nh gi\u00e1 c\u00e1c \u0111\u1eb7c tr\u01b0ng quan tr\u1ecdng nh\u1ea5t.<\/p>\n<p>C\u00e1c ph\u01b0\u01a1ng ph\u00e1p l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng c\u00f3 gi\u00e1m s\u00e1t bao g\u1ed3m:<\/p>\n<ul>\n<li><strong>Ph\u01b0\u01a1ng ph\u00e1p l\u1ecdc<\/strong><\/li>\n<li><strong>Ph\u01b0\u01a1ng ph\u00e1p bao b\u1ecdc<\/strong><\/li>\n<li><strong>Ph\u01b0\u01a1ng ph\u00e1p nh\u00fang<\/strong><\/li>\n<li><strong>Ph\u01b0\u01a1ng ph\u00e1p k\u1ebft h\u1ee3p<\/strong>, k\u1ebft h\u1ee3p hai ho\u1eb7c nhi\u1ec1u ph\u01b0\u01a1ng ph\u00e1p l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng c\u00f3 gi\u00e1m s\u00e1t.<\/li>\n<\/ul>\n<h3><span class=\"ez-toc-section\" id=\"Phuong-phap-loc\"><\/span><strong>Ph\u01b0\u01a1ng ph\u00e1p l\u1ecdc<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Ph\u01b0\u01a1ng ph\u00e1p l\u1ecdc l\u00e0 nh\u00f3m c\u00e1c k\u1ef9 thu\u1eadt l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng ch\u1ec9 quan t\u00e2m \u0111\u1ebfn d\u1eef li\u1ec7u v\u00e0 kh\u00f4ng tr\u1ef1c ti\u1ebfp xem x\u00e9t t\u1ed1i \u01b0u h\u00f3a hi\u1ec7u su\u1ea5t m\u00f4 h\u00ecnh. C\u00e1c bi\u1ebfn \u0111\u1ea7u v\u00e0o \u0111\u01b0\u1ee3c \u0111\u00e1nh gi\u00e1 \u0111\u1ed9c l\u1eadp so v\u1edbi bi\u1ebfn m\u1ee5c ti\u00eau \u0111\u1ec3 x\u00e1c \u0111\u1ecbnh bi\u1ebfn n\u00e0o c\u00f3 t\u01b0\u01a1ng quan cao nh\u1ea5t. C\u00e1c ph\u01b0\u01a1ng ph\u00e1p ki\u1ec3m tra t\u1eebng \u0111\u1eb7c tr\u01b0ng m\u1ed9t \u0111\u01b0\u1ee3c g\u1ecdi l\u00e0 ph\u01b0\u01a1ng ph\u00e1p l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng \u0111\u01a1n bi\u1ebfn.<\/p>\n<p>Th\u01b0\u1eddng \u0111\u01b0\u1ee3c s\u1eed d\u1ee5ng nh\u01b0 m\u1ed9t c\u00f4ng c\u1ee5 ti\u1ec1n <a href=\"https:\/\/interdata.vn\/blog\/data-preprocessing-la-gi\/\">x\u1eed l\u00fd d\u1eef li\u1ec7u<\/a>, ph\u01b0\u01a1ng ph\u00e1p l\u1ecdc l\u00e0 c\u00e1c <a href=\"https:\/\/interdata.vn\/blog\/thuat-toan-algorithm\/\">thu\u1eadt to\u00e1n<\/a> l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng nhanh ch\u00f3ng v\u00e0 hi\u1ec7u qu\u1ea3, xu\u1ea5t s\u1eafc trong vi\u1ec7c gi\u1ea3m s\u1ef1 d\u01b0 th\u1eeba v\u00e0 lo\u1ea1i b\u1ecf c\u00e1c \u0111\u1eb7c tr\u01b0ng kh\u00f4ng li\u00ean quan kh\u1ecfi b\u1ed9 d\u1eef li\u1ec7u. C\u00e1c ph\u00e9p ki\u1ec3m tra th\u1ed1ng k\u00ea kh\u00e1c nhau \u0111\u01b0\u1ee3c s\u1eed d\u1ee5ng \u0111\u1ec3 t\u00ednh \u0111i\u1ec3m cho m\u1ed7i bi\u1ebfn \u0111\u1ea7u v\u00e0o v\u1ec1 t\u01b0\u01a1ng quan. Tuy nhi\u00ean, c\u00e1c ph\u01b0\u01a1ng ph\u00e1p kh\u00e1c t\u1ed1t h\u01a1n trong vi\u1ec7c d\u1ef1 \u0111o\u00e1n hi\u1ec7u su\u1ea5t m\u00f4 h\u00ecnh.<\/p>\n<p>C\u00e1c ph\u01b0\u01a1ng ph\u00e1p l\u1ecdc ph\u1ed5 bi\u1ebfn c\u00f3 s\u1eb5n trong c\u00e1c th\u01b0 vi\u1ec7n h\u1ecdc m\u00e1y nh\u01b0 Scikit-Learn (Sklearn) bao g\u1ed3m:<\/p>\n<ul>\n<li><strong>T\u0103ng th\u00f4ng tin (Information gain)<\/strong>: \u0110o l\u01b0\u1eddng m\u1ee9c \u0111\u1ed9 quan tr\u1ecdng c\u1ee7a s\u1ef1 c\u00f3 m\u1eb7t ho\u1eb7c v\u1eafng m\u1eb7t c\u1ee7a m\u1ed9t \u0111\u1eb7c tr\u01b0ng trong vi\u1ec7c x\u00e1c \u0111\u1ecbnh bi\u1ebfn m\u1ee5c ti\u00eau b\u1eb1ng c\u00e1ch gi\u1ea3m m\u1ee9c \u0111\u1ed9 entropy.<\/li>\n<li><strong>Th\u00f4ng tin t\u01b0\u01a1ng h\u1ed7 (Mutual information)<\/strong>: \u0110\u00e1nh gi\u00e1 s\u1ef1 ph\u1ee5 thu\u1ed9c gi\u1eefa c\u00e1c bi\u1ebfn b\u1eb1ng c\u00e1ch \u0111o l\u01b0\u1eddng th\u00f4ng tin thu \u0111\u01b0\u1ee3c v\u1ec1 m\u1ed9t bi\u1ebfn t\u1eeb m\u1ed9t bi\u1ebfn kh\u00e1c.<\/li>\n<li><strong>Ki\u1ec3m \u0111\u1ecbnh Chi-square (Chi-square test)<\/strong>: \u0110\u00e1nh gi\u00e1 m\u1ed1i quan h\u1ec7 gi\u1eefa hai bi\u1ebfn ph\u00e2n lo\u1ea1i b\u1eb1ng c\u00e1ch so s\u00e1nh gi\u00e1 tr\u1ecb quan s\u00e1t \u0111\u01b0\u1ee3c v\u1edbi gi\u00e1 tr\u1ecb k\u1ef3 v\u1ecdng.<\/li>\n<li><strong>\u0110i\u1ec3m Fisher (Fisher\u2019s score)<\/strong>: S\u1eed d\u1ee5ng \u0111\u1ea1o h\u00e0m \u0111\u1ec3 t\u00ednh to\u00e1n m\u1ee9c \u0111\u1ed9 quan tr\u1ecdng t\u01b0\u01a1ng \u0111\u1ed1i c\u1ee7a m\u1ed7i \u0111\u1eb7c tr\u01b0ng trong vi\u1ec7c ph\u00e2n lo\u1ea1i d\u1eef li\u1ec7u. \u0110i\u1ec3m cao h\u01a1n ch\u1ec9 ra \u1ea3nh h\u01b0\u1edfng l\u1edbn h\u01a1n.<\/li>\n<li><strong>H\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan Pearson (Pearson\u2019s correlation coefficient)<\/strong>: \u0110o l\u01b0\u1eddng m\u1ed1i quan h\u1ec7 gi\u1eefa hai bi\u1ebfn li\u00ean t\u1ee5c v\u1edbi \u0111i\u1ec3m s\u1ed1 t\u1eeb -1 \u0111\u1ebfn 1.<\/li>\n<li><strong>Ng\u01b0\u1ee1ng ph\u01b0\u01a1ng sai (Variance threshold)<\/strong>: Lo\u1ea1i b\u1ecf t\u1ea5t c\u1ea3 c\u00e1c \u0111\u1eb7c tr\u01b0ng c\u00f3 ph\u01b0\u01a1ng sai d\u01b0\u1edbi m\u1ed9t m\u1ee9c t\u1ed1i thi\u1ec3u v\u00ec c\u00e1c \u0111\u1eb7c tr\u01b0ng c\u00f3 ph\u01b0\u01a1ng sai l\u1edbn h\u01a1n c\u00f3 kh\u1ea3 n\u0103ng ch\u1ee9a nhi\u1ec1u th\u00f4ng tin h\u1eefu \u00edch.<\/li>\n<li><strong>T\u1ef7 l\u1ec7 gi\u00e1 tr\u1ecb thi\u1ebfu (Missing value ratio)<\/strong>: T\u00ednh to\u00e1n t\u1ef7 l\u1ec7 ph\u1ea7n tr\u0103m c\u00e1c tr\u01b0\u1eddng h\u1ee3p trong b\u1ed9 d\u1eef li\u1ec7u m\u00e0 m\u1ed9t \u0111\u1eb7c tr\u01b0ng nh\u1ea5t \u0111\u1ecbnh b\u1ecb thi\u1ebfu ho\u1eb7c c\u00f3 gi\u00e1 tr\u1ecb null.<\/li>\n<li><strong>T\u1ef7 l\u1ec7 ph\u00e2n t\u00e1n (Dispersion ratio)<\/strong>: T\u1ef7 l\u1ec7 gi\u1eefa ph\u01b0\u01a1ng sai v\u00e0 gi\u00e1 tr\u1ecb trung b\u00ecnh cho m\u1ed9t \u0111\u1eb7c tr\u01b0ng. S\u1ef1 ph\u00e2n t\u00e1n cao h\u01a1n cho th\u1ea5y nhi\u1ec1u th\u00f4ng tin h\u01a1n.<\/li>\n<li><strong>ANOVA (Analysis of Variance)<\/strong>: X\u00e1c \u0111\u1ecbnh li\u1ec7u c\u00e1c gi\u00e1 tr\u1ecb kh\u00e1c nhau c\u1ee7a \u0111\u1eb7c tr\u01b0ng c\u00f3 \u1ea3nh h\u01b0\u1edfng \u0111\u1ebfn gi\u00e1 tr\u1ecb c\u1ee7a bi\u1ebfn m\u1ee5c ti\u00eau hay kh\u00f4ng.<\/li>\n<\/ul>\n<h3><span class=\"ez-toc-section\" id=\"Phuong-phap-bao-boc\"><\/span><strong>Ph\u01b0\u01a1ng ph\u00e1p bao b\u1ecdc<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Ph\u01b0\u01a1ng ph\u00e1p bao b\u1ecdc hu\u1ea5n luy\u1ec7n thu\u1eadt to\u00e1n h\u1ecdc m\u00e1y v\u1edbi c\u00e1c t\u1eadp h\u1ee3p con kh\u00e1c nhau c\u1ee7a c\u00e1c \u0111\u1eb7c tr\u01b0ng, th\u00eam ho\u1eb7c lo\u1ea1i b\u1ecf c\u00e1c \u0111\u1eb7c tr\u01b0ng v\u00e0 ki\u1ec3m tra k\u1ebft qu\u1ea3 t\u1ea1i m\u1ed7i <a href=\"https:\/\/interdata.vn\/blog\/vong-lap-la-gi\/\">v\u00f2ng l\u1eb7p<\/a>. M\u1ee5c ti\u00eau c\u1ee7a t\u1ea5t c\u1ea3 c\u00e1c ph\u01b0\u01a1ng ph\u00e1p bao b\u1ecdc l\u00e0 t\u00ecm ra b\u1ed9 \u0111\u1eb7c tr\u01b0ng gi\u00fap t\u1ed1i \u01b0u h\u00f3a hi\u1ec7u su\u1ea5t m\u00f4 h\u00ecnh.<\/p>\n<p>Ph\u01b0\u01a1ng ph\u00e1p bao b\u1ecdc ki\u1ec3m tra t\u1ea5t c\u1ea3 c\u00e1c k\u1ebft h\u1ee3p \u0111\u1eb7c tr\u01b0ng c\u00f3 th\u1ec3 c\u00f3 \u0111\u01b0\u1ee3c g\u1ecdi l\u00e0 thu\u1eadt to\u00e1n tham lam (greedy algorithms). Vi\u1ec7c t\u00ecm ki\u1ebfm b\u1ed9 \u0111\u1eb7c tr\u01b0ng t\u1ed1t nh\u1ea5t to\u00e0n di\u1ec7n n\u00e0y y\u00eau c\u1ea7u t\u00ednh to\u00e1n t\u1ed1n k\u00e9m v\u00e0 m\u1ea5t th\u1eddi gian, v\u00ec v\u1eady n\u00f3 th\u00edch h\u1ee3p nh\u1ea5t v\u1edbi c\u00e1c b\u1ed9 d\u1eef li\u1ec7u c\u00f3 kh\u00f4ng gian \u0111\u1eb7c tr\u01b0ng nh\u1ecf h\u01a1n.<\/p>\n<p>C\u00e1c nh\u00e0 khoa h\u1ecdc d\u1eef li\u1ec7u c\u00f3 th\u1ec3 c\u00e0i \u0111\u1eb7t thu\u1eadt to\u00e1n \u0111\u1ec3 d\u1eebng l\u1ea1i khi hi\u1ec7u su\u1ea5t m\u00f4 h\u00ecnh gi\u1ea3m ho\u1eb7c khi s\u1ed1 l\u01b0\u1ee3ng m\u1ee5c ti\u00eau c\u1ee7a c\u00e1c \u0111\u1eb7c tr\u01b0ng \u0111\u01b0\u1ee3c \u0111\u01b0a v\u00e0o.<\/p>\n<p>C\u00e1c ph\u01b0\u01a1ng ph\u00e1p bao b\u1ecdc bao g\u1ed3m:<\/p>\n<ul>\n<li><strong>L\u1ef1a ch\u1ecdn ti\u1ebfn (Forward selection)<\/strong>: B\u1eaft \u0111\u1ea7u v\u1edbi m\u1ed9t b\u1ed9 \u0111\u1eb7c tr\u01b0ng r\u1ed7ng v\u00e0 d\u1ea7n d\u1ea7n th\u00eam c\u00e1c \u0111\u1eb7c tr\u01b0ng m\u1edbi cho \u0111\u1ebfn khi t\u00ecm ra b\u1ed9 \u0111\u1eb7c tr\u01b0ng t\u1ed1i \u01b0u. L\u1ef1a ch\u1ecdn m\u00f4 h\u00ecnh di\u1ec5n ra khi hi\u1ec7u su\u1ea5t c\u1ee7a thu\u1eadt to\u00e1n kh\u00f4ng c\u1ea3i thi\u1ec7n sau b\u1ea5t k\u1ef3 v\u00f2ng l\u1eb7p c\u1ee5 th\u1ec3 n\u00e0o.<\/li>\n<li><strong>L\u1ef1a ch\u1ecdn l\u00f9i (Backward selection)<\/strong>: Hu\u1ea5n luy\u1ec7n m\u00f4 h\u00ecnh v\u1edbi t\u1ea5t c\u1ea3 c\u00e1c \u0111\u1eb7c tr\u01b0ng ban \u0111\u1ea7u v\u00e0 lo\u1ea1i b\u1ecf d\u1ea7n c\u00e1c \u0111\u1eb7c tr\u01b0ng \u00edt quan tr\u1ecdng nh\u1ea5t kh\u1ecfi b\u1ed9 \u0111\u1eb7c tr\u01b0ng.<\/li>\n<li><strong>L\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng to\u00e0n di\u1ec7n (Exhaustive feature selection)<\/strong>: Ki\u1ec3m tra m\u1ecdi k\u1ebft h\u1ee3p c\u00f3 th\u1ec3 c\u1ee7a c\u00e1c \u0111\u1eb7c tr\u01b0ng \u0111\u1ec3 t\u00ecm ra b\u1ed9 \u0111\u1eb7c tr\u01b0ng t\u1ed1i \u01b0u b\u1eb1ng c\u00e1ch t\u1ed1i \u01b0u h\u00f3a m\u1ed9t ch\u1ec9 s\u1ed1 hi\u1ec7u su\u1ea5t \u0111\u00e3 \u0111\u1ecbnh.<\/li>\n<li><strong>Lo\u1ea1i b\u1ecf \u0111\u1eb7c tr\u01b0ng <a href=\"https:\/\/interdata.vn\/blog\/de-quy-la-gi\/\">\u0111\u1ec7 quy<\/a> (Recursive Feature Elimination &#8211; RFE)<\/strong>: M\u1ed9t lo\u1ea1i l\u1ef1a ch\u1ecdn l\u00f9i b\u1eaft \u0111\u1ea7u v\u1edbi kh\u00f4ng gian \u0111\u1eb7c tr\u01b0ng ban \u0111\u1ea7u v\u00e0 lo\u1ea1i b\u1ecf ho\u1eb7c th\u00eam c\u00e1c \u0111\u1eb7c tr\u01b0ng sau m\u1ed7i v\u00f2ng l\u1eb7p d\u1ef1a tr\u00ean m\u1ee9c \u0111\u1ed9 quan tr\u1ecdng t\u01b0\u01a1ng \u0111\u1ed1i c\u1ee7a ch\u00fang.<\/li>\n<li><strong>Lo\u1ea1i b\u1ecf \u0111\u1eb7c tr\u01b0ng \u0111\u1ec7 quy v\u1edbi ki\u1ec3m tra ch\u00e9o (Recursive Feature Elimination with <a href=\"https:\/\/interdata.vn\/blog\/cross-validation-la-gi\/\">Cross-Validation<\/a> &#8211; RFE-CV)<\/strong>: M\u1ed9t bi\u1ebfn th\u1ec3 c\u1ee7a lo\u1ea1i b\u1ecf \u0111\u1ec7 quy s\u1eed d\u1ee5ng ki\u1ec3m tra ch\u00e9o, th\u1eed nghi\u1ec7m m\u00f4 h\u00ecnh tr\u00ean d\u1eef li\u1ec7u ch\u01b0a th\u1ea5y, \u0111\u1ec3 ch\u1ecdn b\u1ed9 \u0111\u1eb7c tr\u01b0ng c\u00f3 hi\u1ec7u su\u1ea5t t\u1ed1t nh\u1ea5t.<\/li>\n<\/ul>\n<h3><span class=\"ez-toc-section\" id=\"Phuong-phap-nhung\"><\/span><strong>Ph\u01b0\u01a1ng ph\u00e1p nh\u00fang<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Ph\u01b0\u01a1ng ph\u00e1p nh\u00fang t\u00edch h\u1ee3p l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng v\u00e0o qu\u00e1 tr\u00ecnh hu\u1ea5n luy\u1ec7n m\u00f4 h\u00ecnh. Khi m\u00f4 h\u00ecnh tr\u1ea3i qua qu\u00e1 tr\u00ecnh hu\u1ea5n luy\u1ec7n, n\u00f3 s\u1eed d\u1ee5ng c\u00e1c c\u01a1 ch\u1ebf kh\u00e1c nhau \u0111\u1ec3 ph\u00e1t hi\u1ec7n c\u00e1c \u0111\u1eb7c tr\u01b0ng ho\u1ea1t \u0111\u1ed9ng k\u00e9m v\u00e0 lo\u1ea1i b\u1ecf ch\u00fang kh\u1ecfi c\u00e1c v\u00f2ng l\u1eb7p ti\u1ebfp theo.<\/p>\n<p>Nhi\u1ec1u ph\u01b0\u01a1ng ph\u00e1p nh\u00fang xoay quanh vi\u1ec7c \u0111i\u1ec1u chu\u1ea9n (regularization), h\u00ecnh ph\u1ea1t \u0111\u1ed1i v\u1edbi c\u00e1c \u0111\u1eb7c tr\u01b0ng d\u1ef1a tr\u00ean ng\u01b0\u1ee1ng h\u1ec7 s\u1ed1 \u0111\u00e3 \u0111\u1ecbnh. C\u00e1c m\u00f4 h\u00ecnh \u0111\u00e1nh \u0111\u1ed5i \u0111\u1ed9 ch\u00ednh x\u00e1c \u0111\u1ec3 \u0111\u1ea1t \u0111\u01b0\u1ee3c \u0111\u1ed9 ch\u00ednh x\u00e1c cao h\u01a1n. K\u1ebft qu\u1ea3 l\u00e0 c\u00e1c m\u00f4 h\u00ecnh ho\u1ea1t \u0111\u1ed9ng h\u01a1i k\u00e9m h\u01a1n trong hu\u1ea5n luy\u1ec7n, nh\u01b0ng tr\u1edf n\u00ean t\u1ed5ng qu\u00e1t h\u01a1n b\u1eb1ng c\u00e1ch gi\u1ea3m hi\u1ec7n t\u01b0\u1ee3ng overfitting.<\/p>\n<p>C\u00e1c ph\u01b0\u01a1ng ph\u00e1p nh\u00fang bao g\u1ed3m:<\/p>\n<ul>\n<li><strong>H\u1ed3i quy LASSO (L1 regression)<\/strong>: Th\u00eam m\u1ed9t h\u00ecnh ph\u1ea1t v\u00e0o h\u00e0m m\u1ea5t m\u00e1t \u0111\u1ed1i v\u1edbi c\u00e1c h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan c\u00f3 gi\u00e1 tr\u1ecb cao, \u0111\u1ea9y ch\u00fang v\u1ec1 gi\u00e1 tr\u1ecb 0. C\u00e1c h\u1ec7 s\u1ed1 c\u00f3 gi\u00e1 tr\u1ecb 0 s\u1ebd b\u1ecb lo\u1ea1i b\u1ecf.<\/li>\n<li><strong>T\u00ednh quan tr\u1ecdng c\u1ee7a r\u1eebng ng\u1eabu nhi\u00ean (Random forest importance)<\/strong>: X\u00e2y d\u1ef1ng h\u00e0ng tr\u0103m c\u00e2y quy\u1ebft \u0111\u1ecbnh, m\u1ed7i c\u00e2y s\u1eed d\u1ee5ng m\u1ed9t t\u1eadp h\u1ee3p ng\u1eabu nhi\u00ean c\u00e1c \u0111i\u1ec3m d\u1eef li\u1ec7u v\u00e0 \u0111\u1eb7c tr\u01b0ng.<\/li>\n<li><strong>T\u0103ng c\u01b0\u1eddng gradient (Gradient boosting)<\/strong>: Th\u00eam c\u00e1c tr\u00ecnh d\u1ef1 \u0111o\u00e1n v\u00e0o m\u1ed9t b\u1ed9 h\u1ee3p nh\u1ea5t, m\u1ed7i v\u00f2ng l\u1eb7p s\u1eeda ch\u1eefa c\u00e1c l\u1ed7i c\u1ee7a v\u00f2ng tr\u01b0\u1edbc.<\/li>\n<\/ul>\n<figure id=\"attachment_27111\" aria-describedby=\"caption-attachment-27111\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Cac-phuong-phap-Feature-Selection-co-ban.png\" alt=\"C\u00e1c ph\u01b0\u01a1ng ph\u00e1p Feature Selection c\u01a1 b\u1ea3n\" width=\"800\" height=\"500\" class=\"size-full wp-image-27111\" title=\"\" srcset=\"https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Cac-phuong-phap-Feature-Selection-co-ban.png 800w, https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Cac-phuong-phap-Feature-Selection-co-ban-300x188.png 300w, https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Cac-phuong-phap-Feature-Selection-co-ban-768x480.png 768w, https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Cac-phuong-phap-Feature-Selection-co-ban-750x469.png 750w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-27111\" class=\"wp-caption-text\">C\u00e1c ph\u01b0\u01a1ng ph\u00e1p Feature Selection c\u01a1 b\u1ea3n<\/figcaption><\/figure>\n<h2><span class=\"ez-toc-section\" id=\"Cac-phuong-phap-Feature-Selection-khong-giam-sat\"><\/span><strong>C\u00e1c ph\u01b0\u01a1ng ph\u00e1p Feature Selection kh\u00f4ng gi\u00e1m s\u00e1t<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>V\u1edbi h\u1ecdc kh\u00f4ng gi\u00e1m s\u00e1t, c\u00e1c m\u00f4 h\u00ecnh t\u1ef1 ph\u00e1t hi\u1ec7n c\u00e1c \u0111\u1eb7c tr\u01b0ng, m\u1eabu v\u00e0 m\u1ed1i quan h\u1ec7 trong d\u1eef li\u1ec7u. Kh\u00f4ng th\u1ec3 \u0111i\u1ec1u ch\u1ec9nh c\u00e1c bi\u1ebfn \u0111\u1ea7u v\u00e0o cho m\u1ed9t bi\u1ebfn m\u1ee5c ti\u00eau \u0111\u00e3 bi\u1ebft. C\u00e1c ph\u01b0\u01a1ng ph\u00e1p l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng kh\u00f4ng gi\u00e1m s\u00e1t s\u1eed d\u1ee5ng c\u00e1c k\u1ef9 thu\u1eadt kh\u00e1c \u0111\u1ec3 \u0111\u01a1n gi\u1ea3n h\u00f3a v\u00e0 tinh ch\u1ec9nh kh\u00f4ng gian \u0111\u1eb7c tr\u01b0ng.<\/p>\n<p>M\u1ed9t ph\u01b0\u01a1ng ph\u00e1p l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng kh\u00f4ng gi\u00e1m s\u00e1t l\u00e0 ph\u00e2n t\u00edch th\u00e0nh ph\u1ea7n ch\u00ednh (PCA). PCA gi\u1ea3m \u0111\u1ed9 ph\u1ee9c t\u1ea1p c\u1ee7a c\u00e1c b\u1ed9 d\u1eef li\u1ec7u l\u1edbn b\u1eb1ng c\u00e1ch bi\u1ebfn \u0111\u1ed5i c\u00e1c bi\u1ebfn c\u00f3 th\u1ec3 c\u00f3 t\u01b0\u01a1ng quan th\u00e0nh m\u1ed9t b\u1ed9 c\u00e1c bi\u1ebfn nh\u1ecf h\u01a1n. C\u00e1c th\u00e0nh ph\u1ea7n ch\u00ednh n\u00e0y gi\u1eef l\u1ea1i h\u1ea7u h\u1ebft th\u00f4ng tin ch\u1ee9a trong b\u1ed9 d\u1eef li\u1ec7u ban \u0111\u1ea7u. PCA gi\u00fap ch\u1ed1ng l\u1ea1i hi\u1ec7n t\u01b0\u1ee3ng &#8220;l\u1eddi nguy\u1ec1n chi\u1ec1u kh\u00f4ng gian&#8221; v\u00e0 gi\u1ea3m overfitting.<\/p>\n<p>C\u00e1c ph\u01b0\u01a1ng ph\u00e1p kh\u00e1c bao g\u1ed3m ph\u00e2n t\u00edch th\u00e0nh ph\u1ea7n \u0111\u1ed9c l\u1eadp (ICA) v\u00e0 m\u00e3 h\u00f3a t\u1ef1 \u0111\u1ed9ng (autoencoders).<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Loi-ich-khi-su-dung-Feature-Selection\"><\/span><strong>L\u1ee3i \u00edch khi s\u1eed d\u1ee5ng Feature Selection<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>S\u1eed d\u1ee5ng Feature Selection (L\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng) trong quy tr\u00ecnh h\u1ecdc m\u00e1y mang l\u1ea1i nhi\u1ec1u l\u1ee3i \u00edch thi\u1ebft th\u1ef1c. C\u00e1c \u01b0u \u0111i\u1ec3m ch\u00ednh bao g\u1ed3m vi\u1ec7c \u0111\u01a1n gi\u1ea3n h\u00f3a m\u00f4 h\u00ecnh, t\u0103ng kh\u1ea3 n\u0103ng di\u1ec5n gi\u1ea3i, c\u1ea3i thi\u1ec7n hi\u1ec7u su\u1ea5t d\u1ef1 \u0111o\u00e1n, gi\u1ea3m \u0111\u00e1ng k\u1ec3 th\u1eddi gian hu\u1ea5n luy\u1ec7n v\u00e0 gi\u1ea3m nguy c\u01a1 qu\u00e1 kh\u1edbp (overfitting).<\/p>\n<h3><span class=\"ez-toc-section\" id=\"Don-gian-hoa-mo-hinh-va-tang-kha-nang-dien-giai\"><\/span><strong>\u0110\u01a1n gi\u1ea3n h\u00f3a m\u00f4 h\u00ecnh v\u00e0 t\u0103ng kh\u1ea3 n\u0103ng di\u1ec5n gi\u1ea3i<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Vi\u1ec7c <strong>lo\u1ea1i b\u1ecf c\u00e1c \u0111\u1eb7c tr\u01b0ng kh\u00f4ng c\u1ea7n thi\u1ebft<\/strong> ho\u1eb7c d\u01b0 th\u1eeba gi\u00fap c\u1ea5u tr\u00fac m\u00f4 h\u00ecnh tr\u1edf n\u00ean g\u1ecdn g\u00e0ng v\u00e0 \u0111\u01a1n gi\u1ea3n h\u01a1n. M\u00f4 h\u00ecnh \u0111\u01a1n gi\u1ea3n th\u01b0\u1eddng d\u1ec5 tri\u1ec3n khai, c\u1eadp nh\u1eadt v\u00e0 g\u1ee1 l\u1ed7i h\u01a1n trong c\u00e1c h\u1ec7 th\u1ed1ng th\u1ef1c t\u1ebf, gi\u1ea3m chi ph\u00ed b\u1ea3o tr\u00ec.<\/p>\n<p>V\u1edbi m\u1ed9t s\u1ed1 l\u01b0\u1ee3ng \u00edt \u0111\u1eb7c tr\u01b0ng h\u01a1n, \u0111\u1eb7c bi\u1ec7t l\u00e0 nh\u1eefng \u0111\u1eb7c tr\u01b0ng c\u00f3 \u00fd ngh\u0129a nghi\u1ec7p v\u1ee5 r\u00f5 r\u00e0ng, ch\u00fang ta c\u00f3 th\u1ec3 d\u1ec5 d\u00e0ng hi\u1ec3u \u0111\u01b0\u1ee3c c\u00e1ch m\u00f4 h\u00ecnh \u0111\u01b0a ra quy\u1ebft \u0111\u1ecbnh. Kh\u1ea3 n\u0103ng di\u1ec5n gi\u1ea3i (interpretability) n\u00e0y gi\u00fap t\u0103ng c\u01b0\u1eddng s\u1ef1 tin t\u01b0\u1edfng v\u00e0o m\u00f4 h\u00ecnh t\u1eeb ph\u00eda ng\u01b0\u1eddi d\u00f9ng v\u00e0 c\u00e1c b\u00ean li\u00ean quan.<\/p>\n<figure id=\"attachment_27110\" aria-describedby=\"caption-attachment-27110\" style=\"width: 800px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Loi-ich-khi-su-dung-Feature-Selection.jpg\" alt=\"L\u1ee3i \u00edch khi s\u1eed d\u1ee5ng Feature Selection\" width=\"800\" height=\"500\" class=\"size-full wp-image-27110\" title=\"\" srcset=\"https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Loi-ich-khi-su-dung-Feature-Selection.jpg 800w, https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Loi-ich-khi-su-dung-Feature-Selection-300x188.jpg 300w, https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Loi-ich-khi-su-dung-Feature-Selection-768x480.jpg 768w, https:\/\/interdata.vn\/blog\/wp-content\/uploads\/2025\/04\/Loi-ich-khi-su-dung-Feature-Selection-750x469.jpg 750w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><figcaption id=\"caption-attachment-27110\" class=\"wp-caption-text\">L\u1ee3i \u00edch khi s\u1eed d\u1ee5ng Feature Selection<\/figcaption><\/figure>\n<h3><span class=\"ez-toc-section\" id=\"Cai-thien-hieu-suat-du-doan\"><\/span><strong>C\u1ea3i thi\u1ec7n hi\u1ec7u su\u1ea5t d\u1ef1 \u0111o\u00e1n<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p><strong>Lo\u1ea1i b\u1ecf c\u00e1c \u0111\u1eb7c tr\u01b0ng g\u00e2y nhi\u1ec5u<\/strong>, kh\u00f4ng li\u00ean quan \u0111\u1ebfn bi\u1ebfn m\u1ee5c ti\u00eau gi\u00fap m\u00f4 h\u00ecnh t\u1eadp trung v\u00e0o c\u00e1c t\u00edn hi\u1ec7u th\u1ef1c s\u1ef1 quan tr\u1ecdng trong d\u1eef li\u1ec7u. \u0110i\u1ec1u n\u00e0y th\u01b0\u1eddng d\u1eabn \u0111\u1ebfn vi\u1ec7c c\u1ea3i thi\u1ec7n \u0111\u1ed9 ch\u00ednh x\u00e1c (accuracy) v\u00e0 c\u00e1c ch\u1ec9 s\u1ed1 \u0111\u00e1nh gi\u00e1 hi\u1ec7u su\u1ea5t kh\u00e1c c\u1ee7a m\u00f4 h\u00ecnh.<\/p>\n<p>C\u00e1c \u0111\u1eb7c tr\u01b0ng d\u01b0 th\u1eeba (redundant features) &#8211; nh\u1eefng \u0111\u1eb7c tr\u01b0ng cung c\u1ea5p th\u00f4ng tin t\u01b0\u01a1ng t\u1ef1 nhau &#8211; c\u0169ng c\u00f3 th\u1ec3 b\u1ecb lo\u1ea1i b\u1ecf. Vi\u1ec7c n\u00e0y gi\u00fap m\u00f4 h\u00ecnh ho\u1ea1t \u0111\u1ed9ng \u1ed5n \u0111\u1ecbnh h\u01a1n v\u00e0 tr\u00e1nh ph\u1ee5 thu\u1ed9c qu\u00e1 nhi\u1ec1u v\u00e0o m\u1ed9t nh\u00f3m th\u00f4ng tin c\u1ee5 th\u1ec3 n\u00e0o \u0111\u00f3 trong d\u1eef li\u1ec7u.<\/p>\n<h3><span class=\"ez-toc-section\" id=\"Giam-thoi-gian-huan-luyen-va-chi-phi-tinh-toan\"><\/span><strong>Gi\u1ea3m th\u1eddi gian hu\u1ea5n luy\u1ec7n v\u00e0 chi ph\u00ed t\u00ednh to\u00e1n<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Hu\u1ea5n luy\u1ec7n m\u1ed9t m\u00f4 h\u00ecnh Machine Learning tr\u00ean t\u1eadp d\u1eef li\u1ec7u c\u00f3 s\u1ed1 l\u01b0\u1ee3ng \u0111\u1eb7c tr\u01b0ng \u00edt h\u01a1n s\u1ebd y\u00eau c\u1ea7u \u00edt ph\u00e9p t\u00ednh to\u00e1n h\u01a1n. \u0110i\u1ec1u n\u00e0y l\u00e0m gi\u1ea3m \u0111\u00e1ng k\u1ec3 th\u1eddi gian c\u1ea7n thi\u1ebft \u0111\u1ec3 hu\u1ea5n luy\u1ec7n m\u00f4 h\u00ecnh, gi\u00fap \u0111\u1ea9y nhanh qu\u00e1 tr\u00ecnh th\u1eed nghi\u1ec7m v\u00e0 ph\u00e1t tri\u1ec3n.<\/p>\n<p>Vi\u1ec7c gi\u1ea3m s\u1ed1 chi\u1ec1u d\u1eef li\u1ec7u c\u0169ng gi\u00fap ti\u1ebft ki\u1ec7m t\u00e0i nguy\u00ean ph\u1ea7n c\u1ee9ng nh\u01b0 b\u1ed9 nh\u1edb <a href=\"https:\/\/interdata.vn\/blog\/ram-server\/\">RAM<\/a> v\u00e0 dung l\u01b0\u1ee3ng l\u01b0u tr\u1eef. \u01afu \u0111i\u1ec3m n\u00e0y tr\u1edf n\u00ean \u0111\u1eb7c bi\u1ec7t quan tr\u1ecdng khi ph\u1ea3i x\u1eed l\u00fd c\u00e1c t\u1eadp d\u1eef li\u1ec7u c\u00f3 quy m\u00f4 r\u1ea5t l\u1edbn (Big Data).<\/p>\n<h3><span class=\"ez-toc-section\" id=\"Giam-nguy-co-qua-khop-Overfitting\"><\/span><strong>Gi\u1ea3m nguy c\u01a1 qu\u00e1 kh\u1edbp (Overfitting)<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p>Hi\u1ec7n t\u01b0\u1ee3ng qu\u00e1 kh\u1edbp (Overfitting) x\u1ea3y ra khi m\u00f4 h\u00ecnh qu\u00e1 ph\u1ee9c t\u1ea1p v\u00e0 h\u1ecdc thu\u1ed9c l\u00f2ng c\u1ea3 nhi\u1ec5u trong d\u1eef li\u1ec7u hu\u1ea5n luy\u1ec7n, d\u1eabn \u0111\u1ebfn kh\u1ea3 n\u0103ng d\u1ef1 \u0111o\u00e1n k\u00e9m tr\u00ean d\u1eef li\u1ec7u m\u1edbi. Feature Selection gi\u00fap gi\u1ea3m \u0111\u1ed9 ph\u1ee9c t\u1ea1p c\u1ee7a m\u00f4 h\u00ecnh b\u1eb1ng c\u00e1ch gi\u1ea3m s\u1ed1 l\u01b0\u1ee3ng <a href=\"https:\/\/interdata.vn\/blog\/tham-so-parameter-la-gi\/\">tham s\u1ed1<\/a>.<\/p>\n<p>M\u00f4 h\u00ecnh \u0111\u01a1n gi\u1ea3n h\u01a1n v\u1edbi \u00edt \u0111\u1eb7c tr\u01b0ng h\u01a1n th\u01b0\u1eddng c\u00f3 <strong>kh\u1ea3 n\u0103ng kh\u00e1i qu\u00e1t h\u00f3a t\u1ed1t h\u01a1n<\/strong>, t\u1ee9c l\u00e0 ch\u00fang h\u1ecdc \u0111\u01b0\u1ee3c quy lu\u1eadt t\u1ed5ng qu\u00e1t t\u1eeb d\u1eef li\u1ec7u thay v\u00ec ch\u1ec9 ghi nh\u1edb c\u00e1c \u0111i\u1ec3m d\u1eef li\u1ec7u c\u1ee5 th\u1ec3. \u0110i\u1ec1u n\u00e0y l\u00e0m gi\u1ea3m nguy c\u01a1 overfitting m\u1ed9t c\u00e1ch hi\u1ec7u qu\u1ea3.<\/p>\n<p>\u01afu \u0111i\u1ec3m n\u00e0y \u0111\u1eb7c bi\u1ec7t ph\u00e1t huy t\u00e1c d\u1ee5ng khi l\u00e0m vi\u1ec7c v\u1edbi d\u1eef li\u1ec7u c\u00f3 s\u1ed1 chi\u1ec1u cao (high-dimensional data), n\u01a1i &#8220;l\u1eddi nguy\u1ec1n chi\u1ec1u kh\u00f4ng gian&#8221; (Curse of Dimensionality) l\u00e0m t\u0103ng nguy c\u01a1 overfitting. Feature Selection l\u00e0 m\u1ed9t c\u00f4ng c\u1ee5 h\u1eefu hi\u1ec7u \u0111\u1ec3 \u0111\u1ed1i ph\u00f3 v\u1edbi th\u00e1ch th\u1ee9c n\u00e0y.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Cach-chon-phuong-phap-Feature-Selection\"><\/span><strong>C\u00e1ch ch\u1ecdn ph\u01b0\u01a1ng ph\u00e1p Feature Selection<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Vi\u1ec7c ch\u1ecdn ph\u01b0\u01a1ng ph\u00e1p Feature Selection t\u1ed1t nh\u1ea5t ph\u1ee5 thu\u1ed9c v\u00e0o \u0111\u1ea7u v\u00e0o v\u00e0 \u0111\u1ea7u ra c\u1ea7n xem x\u00e9t:<\/p>\n<ul>\n<li><strong>\u0110\u1ea7u v\u00e0o s\u1ed1, \u0110\u1ea7u ra s\u1ed1<\/strong>: V\u1ea5n \u0111\u1ec1 l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng trong h\u1ed3i quy v\u1edbi c\u00e1c bi\u1ebfn \u0111\u1ea7u v\u00e0o s\u1ed1 &#8211; s\u1eed d\u1ee5ng h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan, nh\u01b0 h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan Pearson (cho l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng <a href=\"https:\/\/interdata.vn\/blog\/hoi-quy-tuyen-tinh\/\">h\u1ed3i quy tuy\u1ebfn t\u00ednh<\/a>) ho\u1eb7c h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan Spearman (cho h\u1ed3i quy phi tuy\u1ebfn).<\/li>\n<li><strong>\u0110\u1ea7u v\u00e0o s\u1ed1, \u0110\u1ea7u ra ph\u00e2n lo\u1ea1i<\/strong>: V\u1ea5n \u0111\u1ec1 l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng trong ph\u00e2n lo\u1ea1i v\u1edbi c\u00e1c bi\u1ebfn \u0111\u1ea7u v\u00e0o s\u1ed1 &#8211; s\u1eed d\u1ee5ng h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan, xem x\u00e9t m\u1ee5c ti\u00eau ph\u00e2n lo\u1ea1i, nh\u01b0 h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan ANOVA (cho tuy\u1ebfn t\u00ednh) ho\u1eb7c h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan Kendall (cho phi tuy\u1ebfn).<\/li>\n<li><strong>\u0110\u1ea7u v\u00e0o ph\u00e2n lo\u1ea1i, \u0110\u1ea7u ra s\u1ed1<\/strong>: V\u1ea5n \u0111\u1ec1 m\u00f4 h\u00ecnh h\u1ed3i quy v\u1edbi c\u00e1c bi\u1ebfn \u0111\u1ea7u v\u00e0o ph\u00e2n lo\u1ea1i (hi\u1ebfm g\u1eb7p) &#8211; s\u1eed d\u1ee5ng h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan, nh\u01b0 h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan ANOVA (cho tuy\u1ebfn t\u00ednh) ho\u1eb7c h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan Kendall (cho phi tuy\u1ebfn), nh\u01b0ng theo chi\u1ec1u ng\u01b0\u1ee3c l\u1ea1i.<\/li>\n<li><strong>\u0110\u1ea7u v\u00e0o ph\u00e2n lo\u1ea1i, \u0110\u1ea7u ra ph\u00e2n lo\u1ea1i<\/strong>: V\u1ea5n \u0111\u1ec1 m\u00f4 h\u00ecnh ph\u00e2n lo\u1ea1i v\u1edbi c\u00e1c bi\u1ebfn \u0111\u1ea7u v\u00e0o ph\u00e2n lo\u1ea1i &#8211; s\u1eed d\u1ee5ng h\u1ec7 s\u1ed1 t\u01b0\u01a1ng quan, nh\u01b0 ki\u1ec3m \u0111\u1ecbnh Chi-Squared (b\u1ea3ng ph\u1ee5 thu\u1ed9c) ho\u1eb7c Th\u00f4ng tin t\u01b0\u01a1ng h\u1ed7, \u0111\u00e2y l\u00e0 ph\u01b0\u01a1ng ph\u00e1p m\u1ea1nh m\u1ebd kh\u00f4ng ph\u1ee5 thu\u1ed9c v\u00e0o <a href=\"https:\/\/interdata.vn\/blog\/kieu-du-lieu-data-type\/\">ki\u1ec3u d\u1eef li\u1ec7u<\/a>.<\/li>\n<\/ul>\n<p>Feature Selection kh\u00f4ng ch\u1ec9 gi\u00fap t\u1ed1i \u01b0u h\u00f3a hi\u1ec7u su\u1ea5t m\u00f4 h\u00ecnh m\u00e0 c\u00f2n mang l\u1ea1i nhi\u1ec1u l\u1ee3i \u00edch quan tr\u1ecdng nh\u01b0 gi\u1ea3m th\u1eddi gian hu\u1ea5n luy\u1ec7n, gi\u1ea3m nguy c\u01a1 qu\u00e1 kh\u1edbp v\u00e0 t\u0103ng kh\u1ea3 n\u0103ng di\u1ec5n gi\u1ea3i c\u1ee7a m\u00f4 h\u00ecnh.<\/p>\n<p>Vi\u1ec7c hi\u1ec3u \u0111\u01b0\u1ee3c ph\u01b0\u01a1ng ph\u00e1p Feature Selection l\u00e0 g\u00ec v\u00e0 l\u1ef1a ch\u1ecdn ph\u01b0\u01a1ng ph\u00e1p ph\u00f9 h\u1ee3p s\u1ebd ph\u1ee5 thu\u1ed9c v\u00e0o \u0111\u1eb7c \u0111i\u1ec3m c\u1ee7a d\u1eef li\u1ec7u v\u00e0 m\u1ee5c ti\u00eau c\u1ee7a b\u00e0i to\u00e1n. N\u1ebfu bi\u1ebft c\u00e1ch \u00e1p d\u1ee5ng \u0111\u00fang c\u00e1c ph\u01b0\u01a1ng ph\u00e1p l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng, b\u1ea1n s\u1ebd c\u00f3 th\u1ec3 c\u1ea3i thi\u1ec7n \u0111\u00e1ng k\u1ec3 hi\u1ec7u qu\u1ea3 c\u1ee7a m\u00f4 h\u00ecnh Machine Learning, \u0111\u1eb7c bi\u1ec7t khi l\u00e0m vi\u1ec7c v\u1edbi c\u00e1c b\u1ed9 d\u1eef li\u1ec7u ph\u1ee9c t\u1ea1p ho\u1eb7c c\u00f3 chi\u1ec1u cao.<\/p>\n<p>Khi tri\u1ec3n khai c\u00e1c m\u00f4 h\u00ecnh h\u1ecdc m\u00e1y, vi\u1ec7c s\u1eed d\u1ee5ng m\u1ed9t m\u00f4i tr\u01b0\u1eddng \u1ed5n \u0111\u1ecbnh v\u00e0 m\u1ea1nh m\u1ebd l\u00e0 v\u00f4 c\u00f9ng quan tr\u1ecdng. D\u1ecbch v\u1ee5 <a href=\"https:\/\/interdata.vn\/thue-vps\/\">thu\u00ea VPS ch\u1ea5t l\u01b0\u1ee3ng gi\u00e1 r\u1ebb<\/a> t\u1ea1i InterData cung c\u1ea5p ph\u1ea7n c\u1ee9ng th\u1ebf h\u1ec7 m\u1edbi v\u1edbi <a href=\"https:\/\/interdata.vn\/blog\/cpu-server\/\">CPU<\/a> <a href=\"https:\/\/interdata.vn\/blog\/cpu-amd-epyc\/\">AMD EPYC<\/a> v\u00e0 <a href=\"https:\/\/interdata.vn\/blog\/intel-xeon\/\">Intel Xeon<\/a> Platinum, SSD NVMe U.2, gi\u00fap b\u1ea1n x\u1eed l\u00fd d\u1eef li\u1ec7u nhanh ch\u00f3ng v\u00e0 hi\u1ec7u qu\u1ea3 v\u1edbi chi ph\u00ed h\u1ee3p l\u00fd.<\/p>\n<p>N\u1ebfu b\u1ea1n c\u1ea7n m\u1ed9t gi\u1ea3i ph\u00e1p linh ho\u1ea1t v\u00e0 m\u1ea1nh m\u1ebd h\u01a1n, d\u1ecbch v\u1ee5 <a href=\"https:\/\/interdata.vn\/cloud-server\/\">thu\u00ea Cloud Server gi\u00e1 r\u1ebb t\u1ed1c \u0111\u1ed9 cao<\/a> c\u1ee7a InterData l\u00e0 l\u1ef1a ch\u1ecdn l\u00fd t\u01b0\u1edfng. V\u1edbi c\u1ea5u h\u00ecnh t\u1ed1i \u01b0u v\u00e0 b\u0103ng th\u00f4ng cao, d\u1ecbch v\u1ee5 n\u00e0y mang \u0111\u1ebfn hi\u1ec7u su\u1ea5t \u1ed5n \u0111\u1ecbnh cho c\u00e1c d\u1ef1 \u00e1n h\u1ecdc m\u00e1y, gi\u00fap b\u1ea1n t\u1ed1i \u01b0u h\u00f3a th\u1eddi gian v\u00e0 chi ph\u00ed trong qu\u00e1 tr\u00ecnh ph\u00e1t tri\u1ec3n m\u00f4 h\u00ecnh.<\/p>\n<p><strong>INTERDATA<\/strong><\/p>\n<ul>\n<li><strong><a href=\"https:\/\/interdata.vn\/blog\/website-la-gi\/\">Website<\/a>:<\/strong><span>\u00a0<\/span>Interdata.vn<\/li>\n<li><strong>Hotline:<\/strong><span>\u00a0<\/span>1900-636822<\/li>\n<li><strong>Email:<\/strong><span>\u00a0<\/span>Info@interdata.vn<\/li>\n<li><strong>VP\u0110D:<\/strong><span>\u00a0<\/span>240 Nguy\u1ec5n \u0110\u00ecnh Ch\u00ednh, P.11. Q. Ph\u00fa Nhu\u1eadn, TP. Ho\u0302\u0300 Ch\u00ed Minh<\/li>\n<li><strong>VPGD:<\/strong><span>\u00a0<\/span>S\u1ed1 211 \u0110\u01b0\u1eddng s\u1ed1 5, K\u0110T Lakeview City, P. An Ph\u00fa, TP. Th\u1ee7 \u0110\u1ee9c, TP. H\u1ed3 Ch\u00ed Minh<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Trong qu\u00e1 tr\u00ecnh x\u00e2y d\u1ef1ng m\u00f4 h\u00ecnh Machine Learning (H\u1ecdc m\u00e1y), vi\u1ec7c l\u1ef1a ch\u1ecdn \u0111\u1eb7c tr\u01b0ng (Feature Selection) \u0111\u00f3ng vai tr\u00f2 v\u00f4 c\u00f9ng quan tr\u1ecdng gi\u00fap c\u1ea3i thi\u1ec7n hi\u1ec7u su\u1ea5t v\u00e0 \u0111\u1ed9 ch\u00ednh x\u00e1c c\u1ee7a m\u00f4 h\u00ecnh. Feature Selection gi\u00fap gi\u1ea3m b\u1edbt s\u1ed1 l\u01b0\u1ee3ng \u0111\u1eb7c tr\u01b0ng \u0111\u1ea7u v\u00e0o, lo\u1ea1i b\u1ecf nh\u1eefng \u0111\u1eb7c tr\u01b0ng kh\u00f4ng li\u00ean<\/p>\n","protected":false},"author":11,"featured_media":27112,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[108],"tags":[],"class_list":["post-27104","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai"],"_links":{"self":[{"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/posts\/27104","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/users\/11"}],"replies":[{"embeddable":true,"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/comments?post=27104"}],"version-history":[{"count":3,"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/posts\/27104\/revisions"}],"predecessor-version":[{"id":27320,"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/posts\/27104\/revisions\/27320"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/media\/27112"}],"wp:attachment":[{"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/media?parent=27104"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/categories?post=27104"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/interdata.vn\/blog\/wp-json\/wp\/v2\/tags?post=27104"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}