Discussions
Categories
Groups
Community Home
Categories
INTERNAL ENABLEMENT
POPULAR
THRUST SERVICES & TOOLS
CLOUD EDITIONS
Quick Links
MY LINKS
HELPFUL TIPS
Back to website
Home
Web CMS (TeamSite)
How To share the Tokenization Process
jonak
How do i share the Tokenization process when there are Multiple recognizer are running for different fields.
This Problem was addressed in the MetaTagger User’s Guide Release 4.1.1
Page 31 under the heading "Scenario Five - Leverage Existing Tokenization"
But still I am unable to figure out in manage content processor where should I enable Share Tokenization.
Please help
Find more posts tagged with
Comments
Migrateduser
To share Tokenization the rack script parse step need to be identical. Best way to do this is to isolate all "projects" into one content processor per content type, and let all other fields for that content type be populated by another content processor.
So two txt content processors.
One containing the two recognizes with the racke script "parse; recognize; resolve;
One containing all other fileds - title, keywords...
This will work. If you have other types of projects (summarizers or classifiers - each would have to be separated), Benefit is shared parsing of the document.
jonak
Thanks mkimbrell,
I have 3 extractors to configure.
The parse script for all the three are as follows......
1. parse -richTokens -case -useNoLexicon ;extract -config A1.mtx.xml
2. parse -richTokens -case -useNoLexicon ;extract -config A2.mtx.xml
3. parse -richTokens -case -useNoLexicon ;extract -config A3.mtx.xml
So how to go about it.
More over in the CIADMIN GUI I could not find any check box to enable the shared tokenization process