Google N-Gram-Patterns

Google N-Gram-Patterns seeks to build a co-occurrence network based on n-gram data provided by Google Inc.
Download

Google N-Gram-Patterns Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL
  • Price:
  • FREE
  • Publisher Name:
  • Anurag Jain, Bin Lan, Darshan Paranjape and Vishnu Praveen Pedir
  • Publisher web site:

Google N-Gram-Patterns Tags


Google N-Gram-Patterns Description

Google N-Gram-Patterns seeks to build a co-occurrence network based on n-gram data provided by Google Inc. Google N-Gram-Patterns seeks to build a co-occurrence network based on n-gram data provided by Google Inc. This project presents an easy and fast way to analyze Google n-gram data, which is contributed by Google Inc.Google n-gram data consists of a huge amount of word information based on real life searching queries entered by internet users. The huge amount of data makes it so hard to analyze the whole data set. In this project, we present a possible parallel solution to build and access co-occurrence network using Google n-gram data.Moreover, we use the co-occurrence network to find relationship (path) between words in this large corpus. We also build a common library based on C/MPI for all the similar co-occurrence network analysis programs. This method was tested on both Blade system and Altix system from MSI at University of Minnesota Twin City campus.


Google N-Gram-Patterns Related Software