Skip to content

What does it mean to have CNN as both model and tok2vec? #12260

Discussion options

You must be logged in to vote

This comes down to the details of the "listener" mechanism. Rest assured that we don't love this part either --- I tried really hard to find a better solution, and unfortunately I still don't have one.

The listener is a way for multiple components to share weights. So you can have a textcat and a POS tagger, and they both get the same token vectors, and those token vectors will be updated by gradients from both components.

Consider the following two layer definitions:

Replies: 2 comments 5 replies

Comment options

You must be logged in to vote
3 replies
@bpben
Comment options

@honnibal
Comment options

@bpben
Comment options

Answer selected by adrianeboyd
Comment options

You must be logged in to vote
2 replies
@bpben
Comment options

@kadarakos
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / textcat Feature: Text Classifier feat / tok2vec Feature: Token-to-vector layer and pretraining feat / config Feature: Training config
3 participants