-
Universidad Nacional de Colombia
- Bogotá
Highlights
- Pro
Stars
Machine-readable lists of lemma-token pairs in 23 languages.
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
A Study of Graph-based System for Multi-view Clustering
Provides functions for hierarchical latent tree analysis on text data for hierarchical topic detection
An implementation of an AssetBundle for use in Dropwizard that allows user configuration.
TensorFlow code and pre-trained models for BERT
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model
Topic modeling with latent Dirichlet allocation using Gibbs sampling
high-performance graph database for real-time use cases
A library for creating isolated embedded databases for Spring-powered integration tests.
A tool for extracting plain text from Wikipedia dumps
A simple library class which helps with loading dynamic JNI libraries stored in the JAR archive
turtles, patches, and links for kids, teachers, and scientists
scikit-learn: machine learning in Python
Apache Spark - A unified analytics engine for large-scale data processing
💫 Industrial-strength Natural Language Processing (NLP) in Python
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources
Apache TinkerPop - a graph computing framework
Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
Models and examples built with TensorFlow
brat rapid annotation tool (brat) - for all your textual annotation needs

