Zuletzt bearbeitet vor einer Woche
von Margit Link-Rodrigue

Referenz:WikiRAG

alle Erweiterungen

Übersicht
Beschreibung:
Status: stable Lizenz: [[]]
Entwickler: Hallo Welt! Distribution: BlueSpice
Kategorie: Kommunikation Edition: BlueSpice pro, BlueSpice farm, BlueSpice cloud 5.2+

Funktion

The extension WikiRAG provides data for RAG (Retrieval-Augmented Generation) applications from the wiki.

Data is not directly indexed to targets, but first scheduled to be processed by a job at regular intervals. Change observers are normally responsible for detecting indexable changes and scheduling jobs to process. Depending on what changed, change observers will only schedule certain DataProviders to be executed.

Types of pages being indexed

Not all pages are suitable for indexing. This determination is done in IndexabilityChecker (WikiRAG._IndexabilityChecker service and accessible over Scheduler::canPageBeScheduled method). By default, following pages are indexed:

  • Non-talk content pages
  • File pages where file types are OFFICE or TEXT (docx, doc, odt, pdf, txt, md, ...)
  • Any page allowed by WikiRAGCanBeIndexed hook

Prevent page index by MAGIC WORD

Use __NO_RAG_EXPORT__ magic word on a page to prevent it from being indexed.

Technische Information[Bearbeiten | Quelltext bearbeiten]