pkgsrc-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: pkg/38040 (textproc/mecab-{ipadic,jumandic} would have to provide the way to select charset)



The following reply was made to PR pkg/38040; it has been noted by GNATS.

From: Naoto Morishima <naoto%morishima.net@localhost>
To: gnats-bugs%NetBSD.org@localhost
Cc: obache%NetBSD.org@localhost,
        gnats-admin%netbsd.org@localhost,
        pkgsrc-bugs%netbsd.org@localhost
Subject: Re: pkg/38040 (textproc/mecab-{ipadic,jumandic} would have to provide 
the way to select charset)
Date: Sun, 17 Feb 2008 14:47:48 +0900

 Hummm, sorry for my ignorance abount MeCab API to select dictionaries.
 
 >>>>> On Sun, 17 Feb 2008 04:50:02 +0000 (UTC), "OBATA Akio" 
 >>>>> <obache%netbsd.org@localhost> said:
 
 obache> Namazu is only package possibly using MeCab in pkgsrc and it expect
 obache> that charset of the dictionary is euc-jp.
 
  There should be potential users who uses MeCab standalone,
  i.e. without Namazu.  The pkgsrc should not enforce them to use
  euc-jp dictionaries.
 
 obache> MeCab already can select dictionary:
 
  I believe that the pkgsrc system does not have any mechanisms for a
  package to ensure some options has been set in a package it depends.
  For instance, package 'namazu' cannot ensure 'euc-jp' option has been
  set in package 'mecab-ipadic' or something, so we cannot deal with
  'euc-jp' in mecab-ipadic as an option, right?
 
  So there should be several solutions for it;
 
   - allow mecab-ipadic (or jumandic) to install multiple dictionaries
     at the same time by introducing some options.  In this case, these
     package should guarantee to install the dictionary in euc-jp; that
     is, euc-jp should not be included to the options.
 
      pros: no need to create multiple (and possibly fiddly) packages.
      cons: the dictionary in euc-jp is always installed regardless of
            its necessity.
 
   - split mecab-ipadic (or jumandic) to multiple packages according to
     its charset.  Namazu should depend to mecab-ipadic-euc-jp.
 
      pros: users and packages can choice dictionaries to install.
      cons: need to generate and maintain similar packages.
 
  Anyway, namazu should provide configuration file which considers
  which dictionary should be used.
 
  Any other idea?
 
  BTW, mecab-base in the HEAD of pkgsrc cannot be compiled in
  3.1_STABLE and Solaris w/ sunpro.  Should I send pr for them?
 
 Regards,
 Naoto Morishima
 


Home | Main Index | Thread Index | Old Index