Filewatcher File Search File Search
Catalog
Content Search
» » » » »

Lingua-RU-OpenCorpora-Tokenizer-0.05.tar.gz

Homepage:-
Package version:-
Architecture:-
Distribution:Perl-CPAN
Filename:Lingua-RU-OpenCorpora-Tokenizer-0.05.tar.gz

/Lingua-RU-OpenCorpora-Tokenizer-0.05/README

NAME
    Lingua::RU::OpenCorpora::Tokenizer - tokenizer for OpenCorpora project

SYNOPSIS
        my $tokens = $tokenizer->tokens($text);

        my $bounds = $tokenizer->tokens_bounds($text);

DESCRIPTION
    This module tokenizes input texts in Russian language.

    Note that it uses probabilistic algorithm rather than trying to parse
    the language. It also uses some pre-calculated data fre
more»

/Lingua-RU-OpenCorpora-Tokenizer-0.05/inc/Module/Install/ReadmeFromPod.pm

#line 1
package Module::Install::ReadmeFromPod;

use 5.006;
use strict;
use warnings;
use base qw(Module::Install::Base);
use vars qw($VERSION);

$VERSION = '0.12';

sub readme_from {
  my $self = shift;
  return unless $self->is_admin;

  my $file = shift || $self->_all_from
    or die "Can't determine file to make readme_from";
  my $clean = shift;

  print "Writing README from $file\n";

  requ
more»

/Lingua-RU-OpenCorpora-Tokenizer-0.05/Changelog

0.05 - Mon Nov 21 2011
	- data update
	- added more POD

0.04 - Sun Nov 20 2011
	- INCOMPATIBLE CHANGE: refactored files related code (data files now
	stored as GZip archives rather than plaintext files)
	- INCOMPATIBLE CHANGE: tokens_bounds() now returns zero-based index of the
	boundary instead of the position of the character after
	- data files are now represented with classes and proper API
	
more»

Browse inside Lingua-RU-OpenCorpora-Tokenizer-0.05.tar.gz

         [DIR]Lingua-RU-OpenCorpora-Tokenizer-0.05/ (12)

Download Lingua-RU-OpenCorpora-Tokenizer-0.05.tar.gz

Results 1 - 1 of 1
Help - FTP Sites List - Software Dir.
Search over 15 billion files
© 1997-2016 FileWatcher.com