Quora-Insincere-Questions-Classification

Problem Statement

An existential problem for any major website today is how to handle toxic and divisive content. Quora is a platform that empowers people to learn from each other. On Quora, people can ask questions and connect with others who contribute unique insights and quality answers. A key challenge is to weed out insincere questions -- those founded upon false premises, or that intend to make a statement rather than look for helpful answers. In this kaggle competition, Kagglers will develop models that identify and flag insincere questions.

Libraries

scikit-learn
Keras
Tensorflow
Numpy

Files

Quora_data_exploration.ipynb performs basic data exploration
Quora_GRU_no_pretrain_embeddings.ipynb built Gated Recurrent Unit(GRU) neural network model without pretrained word embeddings to do the text classification
Quora_GRU_with_pretrained_embeddings.ipynb built Gated Recurrent Unit(GRU) neural network model with pretrained word embeddings (i.e. Glove)

Results

The evalution metric for this competiion is F1 Score
The private score for GRU without pretrained word embeddings is 0.65286 (ranked 1242/4037, top 30.8%)
The performance is improved with pretrained word embeddings (i.e., 0.67470, ranked 1182/4037, top 29%)

Next step

The results may be further improved by spell check
use multiple pretrained word embeddings (including GoogleNews, wiki-news-300d-1M), and make the classifications on top of multiple recurrent neural network models

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
Quora_GRU_no_pretrain_embeddings.ipynb		Quora_GRU_no_pretrain_embeddings.ipynb
Quora_GRU_with_pretrained_embeddings.ipynb		Quora_GRU_with_pretrained_embeddings.ipynb
Quora_data_exploration.ipynb		Quora_data_exploration.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quora-Insincere-Questions-Classification

Problem Statement

Libraries

Files

Results

Next step

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Quora-Insincere-Questions-Classification

Problem Statement

Libraries

Files

Results

Next step

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages