适用于R的Wordnet分离器

编程入门行业动态更新时间:2024-10-26 10:28:32

本文介绍了适用于R的Wordnet分离器的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！问题描述

我想使用wordnet lemmatizer对a

I would like to use the wordnet lemmatizer to lemmatize the words in a

> a<-c("He saw a see-saw on a sea shore", "she is feeling cold") > a [1] "He saw a see-saw on a sea shore" "she is feeling cold"

我将a转换为语料库并进行预处理(例如停用词去除，词形去除等)

I convert a into a corpus and do pre-processing steps (like stopword removal, lemmatization etc)

> a <- Corpus(VectorSource(a))

我想通过以下方式进行lemmatization，

I wanted to do the lemmatization in the below way,

> filter <- getTermFilter("ExactMatchFilter", a, TRUE) > terms <- getIndexTerms("NOUN", 1, filter) > sapply(terms, getLemma)

但是我得到这个错误

> filter <- getTermFilter("ExactMatchFilter", a, TRUE) Error in .jnew(paste("com.nexagis.jawbone.filter", type, sep = "."), word, : java.lang.NoSuchMethodError: <init>

我的想法是使整个语料库而不是单个词去词缀化，如何实现?

My idea is to lemmatize the whole corpus and not a single word, How can it be accomplished?

推荐答案

将代码循环放置，您可以尝试执行以下操作:

Put you code in a loop, you can try something like this:

lapply(a,function(x){ x.filter <- getTermFilter("ExactMatchFilter", x, TRUE)) terms <- getIndexTerms("NOUN", 1, x.filter) sapply(terms, getLemma) })

更多推荐

适用于R的Wordnet分离器

本文发布于:2023-11-25 07:29:46，感谢您对本站的认可！

本文链接:https://www.elefans.com/category/jswz/34/1628865.html

适用于分离器 Wordnet

上一篇： linux配置vlan后网络不通
下一篇：加入分离器以供显示的项目列表

发布评论取消回复

评论列表（有 0 条评论）

适用于R的Wordnet分离器

发布评论取消回复

最近发表

热门文章

标签列表