Public Repository

Last pushed: 2 years ago
Short Description
SearchBlox is an enterprise search server with built-in connectors for multiple data sources.
Full Description

#How to use this image?

To download/get this image run the below command.

docker pull searchblox/searchblox:v8.3.1

You can run the default SearchBlox by simply running the command.

docker run -d -i -t -p 80:80 searchblox/searchblox:v8.3.1 /opt/searchblox/startSearchBlox

The above command will start SearchBlox Server in port 80

What is SearchBlox?

SearchBlox is a powerful enterprise search and analytics server with built-in connectors, support 32 document formats and searching 37 languages. SearchBlox is ready to use for search, analytics and visualization without any external requirements or dependencies. Search / Analyze / Visualize any data source with SearchBlox.

##Out of the box content connectors for 25+ Repositories

  • Index Websites with secure crawling through HTTP Basic Auth, NTLM, Forms and Secure Certificates including HTTPS sites and proxy support.
  • Index File system with local, network and cloud folders including Amazon S3, Google Drive and DropBox
  • Database crawling for MySQL, MS SQL Server, Oracle, Postgres SQL and Sybase
  • Index CMIS compatible Content Management Systems
  • Index Sharepoint with security credentials
  • Index Alfresco, Documentum, FileNet and Meridio
  • Index Atlassian JIRA
  • Index Hadoop and GridFS data
  • Index Email servers and Outlook PST Archives
  • Index Wikis

##Search User Features

  • Search across one or more collections and get federated search results.
  • Automatic Clustered Search for fast access to the right information.
  • Advanced Search – Filter by file format, language, keywords and date.
  • Faceted Search – Filter by term values, number or date range values and date histogram values.
  • Synonyms – Synonyms can be customized for every collection.
  • Spelling Suggestions – Based on terms from each collection.
  • Auto Suggestions or Typeahead suggestions – Get suggestions as you type.
  • Date Range search – filter search results by a particular date range.
  • Automatic highlighting of user search query terms in HTML and PDF documents.
  • Keyword-in-Context – search result description is displayed with content where the term occurs.
  • User-defined number of search results per page.
  • Simple and Advanced Query Syntax for your most basic to complex search requirements.
  • Support Boolean AND, OR, NOT, Fuzzy, Wildcard, Proximity and custom field searches.
  • Sort – results can be sorted by date, relevance, alphabetically or any custom field in ascending or descending order.
  • Hit Highlighting – query terms are highlighted on search results.
  • Collections – users can search specific collections or across all collections.
  • Email Viewer – View/Export (PDF) your Outlook email messages in our web based viewer.
  • Email Alert – Setup keyword alerts for new content that is discovered.
  • Tag Cloud – View tag cloud for the popular keywords.
  • Top Clicked Documents/Pages – View the most clicked search results.

##Administrator Features

  • Easy to use and intuitive web console to manage all aspects of the Search application
  • Featured Results – Highlight links in the search results page when the user enters specific search terms
  • Fast deployment of clustered search results using in-built clustering engine
  • Built-in Replication to synchronize search indexes across multiple instances of SearchBlox
  • Collections – create unlimited document collections with customized settings
    Branding – search results customization using CSS or XSL stylesheets. XML/JSON data - for complete control of search results
  • Built-in connectors to index Websites, File System, Emails (PST files), Database, MongoDB, Twitter, Google Drive, Amazon S3, CSV and RSS Feed content
  • Built-in file serving of documents in File System Collections and Outlook PST archives (including attachments)
  • Support for indexing content through Proxy Servers
  • Support for crawling the urls listed in Google Sitemap
  • Selective indexing of sections of HTML pages using <noindex> </noindex> or <!–stopindex–> <!–startindex–> tags
  • Protected Content – crawlers can index content protected with Basic HTTP and Form-Based Authentication
  • Advanced Reporting – real-time reporting for top queries and zero match queries for each collection basis
  • On-Demand & Scheduled Indexing of content
  • Check for duplicate documents during indexing
  • Addition and Deletion of individual documents from the index
  • Disable stemming for individual indexes
  • API – Create, Update, Modify and Delete collections programmatically
  • Ready to deploy jQuery and AngularJS faceted search plugins for use on any website, intranet or application
Docker Pull Command