This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
start [2019/07/04 05:46] 182.72.26.6 |
start [2025/10/13 06:30] (current) 85.208.84.179 1 |
||
|---|---|---|---|
| Line 6: | Line 6: | ||
| **Methodology** | **Methodology** | ||
| + | |||
| Logic behind Guided Search is to classify all the courses in an enterprise based on Title and/or Short description of the Course. Classification is performed through IBM Watson's Natural Language Understanding suite. IBM's NLU suite has the ability to analyse the presented text and its metadata to extract concepts, entities and related categories using a simple API framework. The categories are presented as a multi-level hierarchy and can be further tuned and processed. | Logic behind Guided Search is to classify all the courses in an enterprise based on Title and/or Short description of the Course. Classification is performed through IBM Watson's Natural Language Understanding suite. IBM's NLU suite has the ability to analyse the presented text and its metadata to extract concepts, entities and related categories using a simple API framework. The categories are presented as a multi-level hierarchy and can be further tuned and processed. | ||
| Line 11: | Line 12: | ||
| **WorkFlow** | **WorkFlow** | ||
| + | |||
| + | {{:course_search.jpg|}} | ||
| Please find below the link to Guided Search Workflow. | Please find below the link to Guided Search Workflow. | ||
| + | |||
| https://drive.google.com/file/d/1ph11m0tPFEv5HDb1tcGu1SyktJVsKpER/view?usp=sharing | https://drive.google.com/file/d/1ph11m0tPFEv5HDb1tcGu1SyktJVsKpER/view?usp=sharing | ||
| + | **Technical Specifications** | ||
| + | |||
| + | Pentaho ETL performs | ||
| + | - Data extraction | ||
| + | - Text Clean Up | ||
| + | - Generation of NLU's categories using IBM Watson NLU | ||
| + | - Generation of related Keywords through Google Keyword Planner | ||
| + | |||
| + | Cassandra Big Data DB | ||
| + | - Stores Generated Categories and keywords as tokens | ||
| + | |||
| + | SOLR Collection | ||
| + | - Indexes generated tokens and its corresponding Course ID | ||
| + | |||
| + | Python Search Program | ||
| + | - Performs search on SOLR index as a multi step process to display category and the Courses for selected category | ||
| + | |||
| + | **Sample Input/Output** | ||
| + | |||
| + | 1. Enter Search Term | ||
| + | : abb motors | ||
| + | |||
| + | Subcategory | ||
| + | |||
| + | Business Operations | ||
| + | |||
| + | Hardware | ||
| + | |||
| + | Manufacturing | ||
| + | |||
| + | Software | ||
| + | |||
| + | Metals | ||
| + | |||
| + | Computer certification | ||
| + | |||
| + | Cars | ||
| + | |||
| + | Enter Subcategory | ||
| + | :business operations | ||
| + | |||
| + | Courses that match your chosen category are: | ||
| + | 6166 | ||
| + | |||
| + | 6167 | ||
| + | |||
| + | 6168 | ||
| + | |||
| + | 2. Enter Search Term: | ||
| + | :service engineer | ||
| + | |||
| + | Subcategory | ||
| + | |||
| + | Hardware | ||
| + | |||
| + | Business Operations | ||
| + | |||
| + | Certified pre-owned | ||
| + | |||
| + | energy | ||
| + | |||
| + | Enter Subcategory: | ||
| + | energy | ||
| + | |||
| + | Courses that match your chosen category are: | ||
| + | 18616 | ||
| + | 1 | ||