Guest login

    If needed, for guest access to Persagen.org please reload the page and enter the following credentials as directed in the login popup.

      Username: guest
       Password: @Guest123

    Reason: the backend for the landing / search page on Persagen.org is Pg-Solr, a SSL/TLS-secured open-source enterprise-search platform based on Apache Solr. The TLS protocol (*(web security) requires a user/password login.


    Popups | JavaScript

    If you encounter website issues, please check your browser extensions; e.g. Firefox:

      •      Popups: Overlay Remover Auto | ...

      • JavaScript: uBlock Origin | Disable JavaScript | NoScript | ...).

    If that does not solve the issue please check your browser settings; e.g.:

      • how to enable JavaScript in your web browser;

      • how to Activate JavaScript on an Android Phone or Tablet;

      • how to Allow Pop–ups | Allow/block popups

    Recommendations. Persagen.org is best used / viewed on a desktop or computer with a standard monitor. Access on mobile devices may be suboptimal.

    Some Android versions appear to be problematic (likely a permissions issue re: JavaScript or display styles?) - while others work (outstanding issue for future resolution).


    Search help

    Mentioned above, the backend search engine on Persagen.org  (Pg-Solr) is based on Apache Solr  [Wikipedia: Apache Solr]. Pg-Solr enables powerful matching capabilities including phrases, wildcards, joins, grouping and much more.

    Search tips

    Basic searching

  • *:* will retrieve all indexed documents

  • Short words (less than three characters) are ignored.

  • Searches are case-insensitive, e.g. canada will find Canada. Canadian, ...

  • Stemmed searches are enabled: e.g. "Canad*" will find "Canada", "Canadian", ...

  • Advanced searching

    By default, Apache Solr operates as a NoSQL database, employing a self-"managed-schema".

    Pg-Solr, however, uses a custom schema - providing structure to the indexed content, as well as fine-grained capture and searching of document metadata (authors, dates, keywords, keyphrases, named entities, ...).

  • Among the metadata fields captured is "title" (i.e., document titles). Thus, to to find indexed documents containing Canada in the title, search for: title:canada.

  • Likewise, to find documents with

  • With the current (2023-05-10) documents indexed, title:"supreme court" and title:"supreme" AND title:"court" will give the same results,

      pg-solr-screenshot-supreme_court-2023-05-10.png

    Add:

    • The Extended DisMax (eDismax) Query Parser
      • In addition to supporting all the DisMax query parser parameters, Extended Dismax:

        • supports Solr's standard query parser syntax such as (non-exhaustive list):
          • boolean operators such as AND (+, &&), OR (||), NOT (-).
          • optionally treats lowercase "and" and "or" as "AND" and "OR" in Lucene syntax mode
          • optionally allows embedded queries using other query parsers or functions
        • includes improved smart partial escaping in the case of syntax errors; fielded queries, +/-, and phrase queries are still supported in this mode.
        • improves proximity boosting by using word shingles; you do not need the query to match all words in the document before proximity boosting is applied.
        • includes advanced stopword handling: stopwords are not required in the mandatory part of the query but are still used in the proximity boosting part. If a query consists of all stopwords, such as "to be or not to be", then all words are required.
        • includes improved boost function: in Extended DisMax, the boost function is a multiplier rather than an addend, improving your boost results; the additive boost functions of DisMax (bf and bq) are also supported.
        • supports pure negative nested queries: queries such as +foo (-foo) will match all documents.
        • lets you specify which fields the end user is allowed to query, and to disallow direct fielded searches.
        • ...

    • The Standard Query Parser
      • A query to the standard query parser is broken up into terms and operators. There are two types of terms: single terms and phrases.
        • A single term is a single word such as "test" or "hello"
        • A phrase is a group of words surrounded by double quotes such as "hello dolly"
      • Multiple terms can be combined together with Boolean operators to form more complex queries (as described below).
      • ...

    Help

    Help - Searches

    Searching in BT-Solr employs Apache Solr as the back-end (hidden) document indexer / document store.

    While the default functionality on the (main tabs) "Search" page works intuitively and well, the following provide tips for more advanced searches.

    Solr Leading Wildcard Queries

    • E.g., *tion
    • Not available; see (e.g.) this article.
    • Update [2021-03-04; BT-Solr (ResultWidget) v.2,8] Both leading and trailing wildcard queries now working! E.g., "*alifa*" finds "Halifax", "Califano".

    Help - NLP [Natural Language Processing]

    Keywords

    • Keywords are words or concepts associated with significant content, topics and categories. E.g.:
      • science;
      • business;
      • cats;
      • dogs;
      • weather;
      • vegetables;
      • newspapers;
      • songs;
      • videos;
      • ...

    Nouns

    • Nouns - nominally persons, places, things - are useful for categorizing and organizing collections of texts (corpora), and providing structure to relational graph databases (graphs → knowledge graphs).

    Named Entities | Proper Names

    • Victoria Stuart
    • Canada
    • Google