THRON comes out-of-the-box with a powerful semantic engine, capable of analyzing content the moment they are published into platform and automatically creating and assigning tags related to their topic. The elasticity of the engine in tags assignation is entirely configurable through system settings.
Tags created by the Semantic engine are "not categorized" (further information can be found here). Semantic engine will not generate multiple tags with the same name: if a tag already exists it will be associated to the content. Replacing content's version will trigger the update of tags associated by the engine.
Note 1: All the actions performed by the engine over tags (creation and association to content, removal from content) will only affect engine-generated tags unless they have been manually added to content.
Semantic engine is available for TOPIC class only. It cannot be enabled on TARGET class. It operates on the following content types:
- Documents: tags will be assigned according to engine configuration and the number of times each word is repeated. Supported filetypes are: txt, html, doc, docx, pdf, ppt, pptx. Supported languages are: English, Dutch, French, German, Italian, Polish, Portugese, Russian, Spanish, Swedish.
- Images: tags will be assigned according to elements represented in the picture.
Note 2: If a document has a combination of images and text, only the text will be processed by the semantic engine. If a pdf file type does not have "pure" text but text as an image, this will not be processed by the semantic engine.
Here is an example on how the engine works: this is a document about THRON CDN performances, processed by the engine with different configurations. Have a look at the result:
These tags have been extracted with engine recognition reliability at 50%:
These tags have been extracted with engine recognition reliability at 60%:
These tags have been extracted with engine recognition reliability at 70%:
These tags have been extracted with engine recognition reliability at 80% and more:
Note 3: Any word present in the title or description of the content (or body if it is a document), and preceded by the special character "#", will be treated by the semantic engine as a tag.