. "Ti\u1EC1n x\u1EED l\u00FD d\u1EEF li\u1EC7u, x\u1EED l\u00FD s\u01A1 b\u1ED9 v\u0103n b\u1EA3n: x\u00F3a b\u1ECF nh\u1EEFng k\u00ED t\u1EF1, nh\u1EEFng m\u00E3 \u0111i\u1EC1u khi\u1EC3n, nh\u1EEFng v\u00F9ng kh\u00F4ng c\u1EA7n thi\u1EBFt cho h\u1EC7 th\u1ED1ng g\u1ED3m: t\u00E1ch \u0111o\u1EA1n/c\u00E2u/t\u1EEB (paragraph/sentence/word segmentation), l\u00E0m s\u1EA1ch (cleaning), t\u00EDch h\u1EE3p (integreation), chuy\u1EC3n \u0111\u1ED5i (transformation), gi\u1EA3m s\u1ED1 chi\u1EC1u (reduction)."@en . "Ti\u1EC1n x\u1EED l\u00FD d\u1EEF li\u1EC7u, x\u1EED l\u00FD s\u01A1 b\u1ED9 v\u0103n b\u1EA3n: x\u00F3a b\u1ECF nh\u1EEFng k\u00ED t\u1EF1, nh\u1EEFng m\u00E3 \u0111i\u1EC1u khi\u1EC3n, nh\u1EEFng v\u00F9ng kh\u00F4ng c\u1EA7n thi\u1EBFt cho h\u1EC7 th\u1ED1ng g\u1ED3m: t\u00E1ch \u0111o\u1EA1n/c\u00E2u/t\u1EEB (paragraph/sentence/word segmentation), l\u00E0m s\u1EA1ch (cleaning), t\u00EDch h\u1EE3p (integreation), chuy\u1EC3n \u0111\u1ED5i (transformation), gi\u1EA3m s\u1ED1 chi\u1EC1u (reduction)."@en . "Preprocessing"@en .