Skip to content

Fix pseudo-corpus format#17

Open
Invidia19 wants to merge 1 commit into
qiang2100:masterfrom
Invidia19:pseudocorpus
Open

Fix pseudo-corpus format#17
Invidia19 wants to merge 1 commit into
qiang2100:masterfrom
Invidia19:pseudocorpus

Conversation

@Invidia19

@Invidia19 Invidia19 commented May 24, 2020

Copy link
Copy Markdown

Based on https://arxiv.org/pdf/1412.5404.pdf, the pseudo-corpus format contains lines of each word's list of adjacent.

This is also based on code linked below ( on file PrepareInput.java )
https://figshare.com/articles/Code_of_word_network_topic_model/5572591

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant