Source-Changes-HG archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
[pkgsrc/trunk]: pkgsrc/textproc/split-thai Update to 0.6
details: https://anonhg.NetBSD.org/pkgsrc/rev/400e39d5ac25
branches: trunk
changeset: 437269:400e39d5ac25
user: scole <scole%pkgsrc.org@localhost>
date: Wed Aug 19 16:52:25 2020 +0000
description:
Update to 0.6
- handle numbers [0-9] better with emacs splitter
diffstat:
textproc/split-thai/Makefile | 4 ++--
textproc/split-thai/files/thai-utility.el | 2 +-
2 files changed, 3 insertions(+), 3 deletions(-)
diffs (24 lines):
diff -r 9d671af4f3b3 -r 400e39d5ac25 textproc/split-thai/Makefile
--- a/textproc/split-thai/Makefile Wed Aug 19 16:32:57 2020 +0000
+++ b/textproc/split-thai/Makefile Wed Aug 19 16:52:25 2020 +0000
@@ -1,6 +1,6 @@
-# $NetBSD: Makefile,v 1.5 2020/08/18 17:10:28 scole Exp $
+# $NetBSD: Makefile,v 1.6 2020/08/19 16:52:25 scole Exp $
-PKGNAME= split-thai-0.5
+PKGNAME= split-thai-0.6
CATEGORIES= textproc
MAINTAINER= pkgsrc-users%NetBSD.org@localhost
COMMENT= Utilities to split UTF-8 Thai text into words
diff -r 9d671af4f3b3 -r 400e39d5ac25 textproc/split-thai/files/thai-utility.el
--- a/textproc/split-thai/files/thai-utility.el Wed Aug 19 16:32:57 2020 +0000
+++ b/textproc/split-thai/files/thai-utility.el Wed Aug 19 16:52:25 2020 +0000
@@ -183,7 +183,7 @@
'thai-break-words doesn't always split numbers properly. this may
improve tokenization somewhat."
;; xxx this really should be fixed in 'thai-word lib
- (let* ((num_rexp "\\([\u0e50-\u0e59]+\\)") ;; thai numbers
+ (let* ((num_rexp "\\([0-9\u0e50-\u0e59]+\\)") ;; numbers
(nonnum_rexp "\\([\u0e00-\u0e4f\u0e5a-\u0e7f]\\)") ;; "non-numbers"
(trailing_rexp (concat num_rexp nonnum_rexp))
(leading_rexp (concat nonnum_rexp num_rexp)))
Home |
Main Index |
Thread Index |
Old Index