pkgsrc-Changes archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

CVS commit: pkgsrc/textproc/split-thai



Module Name:    pkgsrc
Committed By:   scole
Date:           Wed Aug 19 16:52:25 UTC 2020

Modified Files:
        pkgsrc/textproc/split-thai: Makefile
        pkgsrc/textproc/split-thai/files: thai-utility.el

Log Message:
Update to 0.6
- handle numbers [0-9] better with emacs splitter


To generate a diff of this commit:
cvs rdiff -u -r1.5 -r1.6 pkgsrc/textproc/split-thai/Makefile
cvs rdiff -u -r1.5 -r1.6 pkgsrc/textproc/split-thai/files/thai-utility.el

Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.

Modified files:

Index: pkgsrc/textproc/split-thai/Makefile
diff -u pkgsrc/textproc/split-thai/Makefile:1.5 pkgsrc/textproc/split-thai/Makefile:1.6
--- pkgsrc/textproc/split-thai/Makefile:1.5     Tue Aug 18 17:10:28 2020
+++ pkgsrc/textproc/split-thai/Makefile Wed Aug 19 16:52:25 2020
@@ -1,6 +1,6 @@
-# $NetBSD: Makefile,v 1.5 2020/08/18 17:10:28 scole Exp $
+# $NetBSD: Makefile,v 1.6 2020/08/19 16:52:25 scole Exp $
 
-PKGNAME=       split-thai-0.5
+PKGNAME=       split-thai-0.6
 CATEGORIES=    textproc
 MAINTAINER=    pkgsrc-users%NetBSD.org@localhost
 COMMENT=       Utilities to split UTF-8 Thai text into words

Index: pkgsrc/textproc/split-thai/files/thai-utility.el
diff -u pkgsrc/textproc/split-thai/files/thai-utility.el:1.5 pkgsrc/textproc/split-thai/files/thai-utility.el:1.6
--- pkgsrc/textproc/split-thai/files/thai-utility.el:1.5        Tue Aug 18 17:10:28 2020
+++ pkgsrc/textproc/split-thai/files/thai-utility.el    Wed Aug 19 16:52:25 2020
@@ -183,7 +183,7 @@ space at word boundaries. (wrapper for '
 'thai-break-words doesn't always split numbers properly. this may
 improve tokenization somewhat."
   ;; xxx this really should be fixed in 'thai-word lib
-  (let* ((num_rexp "\\([\u0e50-\u0e59]+\\)") ;; thai numbers
+  (let* ((num_rexp "\\([0-9\u0e50-\u0e59]+\\)") ;; numbers
         (nonnum_rexp "\\([\u0e00-\u0e4f\u0e5a-\u0e7f]\\)") ;; "non-numbers"
         (trailing_rexp (concat num_rexp nonnum_rexp))
         (leading_rexp (concat nonnum_rexp num_rexp)))



Home | Main Index | Thread Index | Old Index