MongoDB Text Search Tutorial

10.1.2013 | 7 minutes of reading time

In my introduction to text search in MongoDB , we had a look at the basic features. Today we’ll have a closer look at the details.

API

You may have noticed that a text search is not executed with a find() command. Instead you call

1db.foo.runCommand( "text", {search: "bar"} )

Remember it’s an experimental feature still. Adding it to the implementation of the find() command would have mixed critical production code with the new text search feature. When executed via a runCommand() call, text search can be run and tested in isolation.

I expect to see a new query operator like $text or $textsearch as soon as text search is integrated with the standard find() command.

Text Query Syntax

In the previous examples we just searched for a single word. We can do more than that. Let’s have a look at the following example:

1db.foo.drop()
2db.foo.ensureIndex( {txt: "text"} )
3db.foo.insert( {txt: "Robots are superior to humans"} )
4db.foo.insert( {txt: "Humans are weak"} )
5db.foo.insert( {txt: "I, Robot - by Isaac Asimov"} )

A search for “robot” will find two documents, the same it true for “human”:

1> db.foo.runCommand("text", {search: "robot"}).results.length
22
3> db.foo.runCommand("text", {search: "human"}).results.length
42

When searching for multiple terms, an OR search is performed, yielding three documents in our example:

1> db.foo.runCommand("text", {search: "human robot"}).results.length
23

I would have expected that the given search words are AND-ed not OR-ed.

Negation

By adding a heading minus sign to a search word, you can exclude documents containing that word. Let’s say, we want all documents on “robot” but no “humans”.

1> db.foo.runCommand("text", {search: "robot -humans"})
2{
3        "queryDebugString" : "robot||human||||",
4        "language" : "english",
5        "results" : [
6                {
7                        "score" : 0.6666666666666666,
8                        "obj" : {
9                                "_id" : ObjectId("50ebc484214a1e88aaa4ada0"),
10                                "txt" : "I, Robot - by Isaac Asimov"
11                        }
12                }
13        ],
14        "stats" : {
15                "nscanned" : 2,
16                "nscannedObjects" : 0,
17                "n" : 1,
18                "timeMicros" : 212
19        },
20        "ok" : 1
21}

Phrase Search

By enclosing multiple words inside quotes (“foo bar”) you perform a phrase search . Inside a phrase, order is important and stop words are also taken into account:

1> db.foo.runCommand("text", {search: '"robots are"'})
2{
3        "queryDebugString" : "robot||||robots are||",
4        "language" : "english",
5        "results" : [
6                {
7                        "score" : 0.6666666666666666,
8                        "obj" : {
9                                "_id" : ObjectId("50ebc482214a1e88aaa4ad9e"),
10                                "txt" : "Robots are superior to humans"
11                        }
12                }
13        ],
14        "stats" : {
15                "nscanned" : 2,
16                "nscannedObjects" : 0,
17                "n" : 1,
18                "timeMicros" : 185
19        },
20        "ok" : 1
21}

Please have a look at the “queryDebugField”:

1"queryDebugString" : "robot||||robots are||"

It tells us that our search string contains one stem “robot” but also the phrase “robots are”. That’s the reason we have only one hit. Compare that to these searches:

1> // order matters inside phrase
2> db.foo.runCommand("text", {search: '"are robots"'}).results.length
30
4> // no phrase search --> OR query
5> db.foo.runCommand("text", {search: 'are robots'}).results.length
62

Multi Language Support

Stemming and stop word filtering are both language dependent. So we have to tell MongoDB what language to use for indexing and searching if you want to use other languages than the default which is English. MongoDB uses the open source Snowball stemmer that supports these languages .

In order to use another language for indexing and searching, you do this when creating the index:

1db.de.ensureIndex( {txt: "text"}, {default_language: "german"} )

With this setting, MongoDB assumes that all text in the field “txt” and all text searches on that collection are in German. Let’s see if it works:

1> db.de.insert( {txt: "Ich bin Dein Vater, Luke." } )
2> db.de.validate().keysPerIndex["text.de.$txt_text"]
32

As you can see, there are only two index keys, so stop word filtering did occur (this time with a German stop word list. Vater is the German word for father, not some typo with Vader) Let’s try some searches:

1> db.de.runCommand("text", {search: "ich"}).results.length
20
3> db.de.runCommand("text", {search: "Vater"}).results.length
41
5> db.de.runCommand("text", {search: "Luke"}).results.length
61

Please note that we don’t have to give the language we are searching for because it is derived from the index. We have hits for the meaningful words “Vater” and “Luke”, but not for the stop word “ich” (which means “I”).

It it also possible to mix multiple languages in the same index. Each single document can have its own language:

1db.de.insert( {language:"english", txt: "Ich bin ein Berliner" } )

If a field “language” is present, its content defines the language for stemming and stop word filtering for the indexed field(s) of that document. The word “ich” is not a stop word in English, so it is indexed now.

1// default language: german -> no hits
2> db.de.runCommand("text", {search: "ich"})
3{
4        "queryDebugString" : "||||||",
5        "language" : "german",
6        "results" : [ ],
7        "stats" : {
8                "nscanned" : 0,
9                "nscannedObjects" : 0,
10                "n" : 0,
11                "timeMicros" : 96
12        },
13        "ok" : 1
14}
15 
16// search for English -> one hit
17> db.de.runCommand("text", {search: "ich", language: "english"})
18{
19        "queryDebugString" : "ich||||||",
20        "language" : "english",
21        "results" : [
22                {
23                        "score" : 0.625,
24                        "obj" : {
25                                "_id" : ObjectId("50ed163b1e27d5e73741fafb"),
26                                "language" : "english",
27                                "txt" : "Ich bin ein Berliner"
28                        }
29                }
30        ],
31        "stats" : {
32                "nscanned" : 1,
33                "nscannedObjects" : 0,
34                "n" : 1,
35                "timeMicros" : 161
36        },
37        "ok" : 1
38}

What happened here? The default language for searching is German. So the first search has no result (as before). In the second search we say to search for English text (to be more precise: for index keys that were generated with an English stemmer and stop words). That’s why we find the famous sentence from JFK.

What does that mean? Well, you have are real multi language text search at hand. You can store text messages from around the world in one collection and still search them dependent on the language.

Multiple Fields

A text index can span more that one field. If you are using more than one field, each field can have its one weight. That enables you to have indexed text parts of your document with different meanings.

1> db.mail.ensureIndex( {subject: "text", body: "text"}, {weights: {subject: 10} } )
2> db.mail.getIndices()
3[
4        ...
5        {
6                "v" : 0,
7                "key" : {
8                        "_fts" : "text",
9                        "_ftsx" : 1
10                },
11                "ns" : "de.mail",
12                "name" : "subject_text_body_text",
13                "weights" : {
14                        "body" : 1,
15                        "subject" : 10
16                },
17                "default_language" : "english",
18                "language_override" : "language"
19        }
20]

We created a text index spanning the fields “subject” and “body”, where the first got a weight of 10 and the latter the standard weight 1. Let’s see what impact these weights have:

1> db.mail.insert( {subject: "Robot leader to minions", body: "Humans suck", prio: 0 } )
2> db.mail.insert( {subject: "Human leader to minions", body: "Robots suck", prio: 1 } )
3> db.mail.runCommand("text", {search: "robot"})
4{
5        "queryDebugString" : "robot||||||",
6        "language" : "english",
7        "results" : [
8                {
9                        "score" : 6.666666666666666,
10                        "obj" : {
11                                "_id" : ObjectId("50ed1be71e27d5e73741fafe"),
12                                "subject" : "Robot leader to minions",
13                                "body" : "Humans suck"
14                                "prio" : 0 
15                        }
16                },
17                {
18                        "score" : 0.75,
19                        "obj" : {
20                                "_id" : ObjectId("50ed1bfd1e27d5e73741faff"),
21                                "subject" : "Human leader to minions",
22                                "body" : "Robots suck"
23                                "prio" : 1
24                        }
25                }
26        ],
27        "stats" : {
28                "nscanned" : 2,
29                "nscannedObjects" : 0,
30                "n" : 2,
31                "timeMicros" : 148
32        },
33        "ok" : 1
34}

The document with “robot” in the “subject” field has much higher score because the weight of 10 is a taken as a multiplier.

Filtering and Projection

You can apply additional search criteria via filtering:

1> db.mail.runCommand("text", {search: "robot", filter: {prio:0} } )
2{
3        "queryDebugString" : "robot||||||",
4        "language" : "english",
5        "results" : [
6                {
7                        "score" : 6.666666666666666,
8                        "obj" : {
9                                "_id" : ObjectId("50ed22621e27d5e73741fb04"),
10                                "subject" : "Robot leader to minions",
11                                "body" : "Humans suck",
12                                "prio" : 0
13                        }
14                }
15        ],
16        "stats" : {
17                "nscanned" : 2,
18                "nscannedObjects" : 2,
19                "n" : 1,
20                "timeMicros" : 185
21        },
22        "ok" : 1
23}

Please note that filtering does not use an index.

If you are interested only in a subset of fields, you can use projection (similar to the aggreation framework):

1> db.mail.runCommand("text", {search: "robot", project: {_id:0, prio:0} } )
2{
3        "queryDebugString" : "robot||||||",
4        "language" : "english",
5        "results" : [
6                {
7                        "score" : 6.666666666666666,
8                        "obj" : {
9                                "subject" : "Robot leader to minions",
10                                "body" : "Humans suck"
11                        }
12                },
13                {
14                        "score" : 0.75,
15                        "obj" : {
16                                "subject" : "Human leader to minions",
17                                "body" : "Robots suck"
18                        }
19                }
20        ],
21        "stats" : {
22                "nscanned" : 2,
23                "nscannedObjects" : 0,
24                "n" : 2,
25                "timeMicros" : 127
26        },
27        "ok" : 1
28}

Filtering and projection can be combined, of course.

Examples

All examples can be found on github . Try them yourself.

Summary

With this second part on MongoDB text search we had a look at the more intereting features of the text search capability. For a start that’s quite a good toolbox to implement your own search engines. I’m looking forward your feedback.

Was this post helpful?

Likes

Blog author

Tobias Trelle

Software Architect

Do you still have questions? Just send me a message.

fromTobias Trelle

ctop – manage and monitor your Docker containers

In this post, I’d like to introduce you to a nice command line tool called ctop. I discovered it when I was looking for a tool to monitor some Docker containers for a MongoDB replica set on my local machine while running some load tests. ctop is basically...

Container

17.12.2018 | 1 Minuten Lesezeit

Tobias Trelle

Leaflet und GeoJSON-Daten

Heute zeige ich euch, wie man mittels der JavaScript-Bibliothek Leaflet GeoJSON -Daten auf einer Karte in eigenen Anwendungen darstellen kann. Wie man dies mittels des Google Maps JavaScript API macht, habe ich in diesem Beitrag erklärt . Wir werden ...

Softwareentwicklung
JavaScript

11.6.2018 | 3 Minuten Lesezeit

Tobias Trelle

Google Cloud Function for Machine Learning

In this post I’ll show you how to use a Google Cloud Function to access the machine learning API for natural language processing . Cloud functions are one of the serverless features of the GCP. Please keep in mind that serverless does not mean that your...

Cloud
Google Cloud
Machine Learning

21.5.2018 | 5 Minuten Lesezeit

Tobias Trelle

Google Cloud Natural Language API

In this article I’d like to give you a short introduction to a subset of Google’s machine learning capabilities: the natural language API. This API processes text snippets and can apply several analysis algorithms: analyze-entities: detects entities ...

Cloud
Google Cloud
Machine Learning

6.5.2018 | 4 Minuten Lesezeit

Tobias Trelle

Google Maps API und GeoJSON-Daten

Heute zeige ich euch, wie man GeoJSON-Daten in eigenen Anwendungen in Zusammenhang mit Google Maps anzeigen kann. In meinem GeoJSON-Tutorial hatte ich kurz angesprochen, wie man GeoJSON-Daten mit Drittanbieter-Diensten darstellen kann. Zur Einbettung...

Softwareentwicklung
JavaScript
Google

15.4.2018 | 3 Minuten Lesezeit

Tobias Trelle

RESTful Microservices on the Google Cloud Platform

This tutorial shows you how to develop a RESTful microservice running on the Google Cloud Platform. I already explained how to deploy Spring Boot applications to the AppEngine and how to set up a MongoDB replica set in the Compute Engine . Today you...

Cloud
Google Cloud
Microservices
API
Spring

8.4.2018 | 3 Minuten Lesezeit

Tobias Trelle

GeoJSON Tutorial

In meinem Artikel über die Identifizierung potentieller EX-Raid Arenen in Pokémon GO habe ich das Thema GeoJSON nur kurz als Exkurs erwähnt. Heute möchte ich etwas detaillierter in dieses Thema einsteigen. GeoJSON Spezifikation Was genau sind denn überhaupt...

Data
Softwareentwicklung
JavaScript

19.3.2018 | 4 Minuten Lesezeit

Tobias Trelle

Cloud Launcher for MongoDB in the Google Compute Engine

In this post you will learn how to use Google’s Cloud Launcher to set up instances for a MongoDB replica set in the Google Compute Engine. Replication in MongoDB A minimal MongoDB replica set consists of two data bearing nodes and one so-called arbiter...

Cloud
Infrastructure as Code
Google
NoSQL

5.3.2018 | 3 Minuten Lesezeit

Tobias Trelle

Deploying Spring Boot Applications in the Google AppEngine Flex Environment

In this tutorial I will show how to set up a deployment of Spring Boot applications for the AppEngine Flex environment in the Google Cloud infrastructure. Prerequisites You should be familiar with the Spring Boot ecosystem and should be able to use Maven...

Software development
Cloud
Google
Google Cloud
Spring

13.2.2018 | 2 Minuten Lesezeit

Tobias Trelle

EX-Raid-Arenen in Pokémon GO identifizieren

Heute betreiben wir ein wenig Data Mining mit Geo-Daten, um herauszufinden, wie man potentielle EX-Raid Arenen im Augmented Reality -Spiel Pokémon GO identifizieren kann. Pokémon GO Basics In Pokémon GO geht es darum, möglichst viele der kleinen Pokémon...

Data
JavaScript
AR/VR

5.2.2018 | 5 Minuten Lesezeit

Tobias Trelle

Change Streams in MongoDB 3.6

MongoDB 3.6 introduces an interesting API enhancement called change streams. With change streams you can watch for changes to certain collections by means of the driver API. This feature replaces all the custom oplog watcher implementations out there...

Change Management
NoSQL

15.1.2018 | 2 Minuten Lesezeit

Tobias Trelle

Spring Cloud Service Discovery with Dynamic Metadata

Spring Cloud Service Discovery If you are running applications consisting of a lot of microservices depending on each other, you are probably using some kind of service registry. Spring Cloud offers a set of starters for interacting with the most common...

Cloud
Software architecture
Spring

8.1.2018 | 2 Minuten Lesezeit

Tobias Trelle

Lego WeDo 2.0 Programmierung

Den Lego WeDo 2.0 Bausatz habe ich in bereits in einem ersten Post vorgestellt . Im heutigen Beitrag möchte ich genauer auf dessen Programmierung eingehen. Meet Milo Zunächst muss aber erst mal Hardware her. Der Baukasten enthält (zum Glück, wie ich ...

Softwareentwicklung
Testing

18.10.2017 | 5 Minuten Lesezeit

Tobias Trelle

JUnit 5 – Des Kaisers neue Kleider

JUnit 5 ist im September 2017 in der ersten stabilen Version erschienen. In diesem Post möchte ich Euch die wichtigsten neuen Features vorstellen. Dabei gehe ich davon aus, dass der geneigte Leser mit JUnit 4 halbwegs vertraut ist und Vergleiche dann...

Java
Testing

1.10.2017 | 7 Minuten Lesezeit

Tobias Trelle

Unboxing Lego WeDo 2.0 Roboter Bausatz

In diesem und weiteren Posts möchte ich Euch das Lego WeDo 2.0 Set (45300) vorstellen. Es gehört zur Lego Education Linie und hat Kinder im Grundschulalter als Zielgruppe (und natürlich auch die zugehörigen AFOL s). Das Set wird in einem robusten stabelbaren...

Softwareentwicklung
Testing
Künstliche Intelligenz

27.9.2017 | 2 Minuten Lesezeit

Tobias Trelle

Graphen-Visualisierung mit Neo4j

In diesem Artikel möchte ich nach einer kurzen Einführung in die Graphen-Theorie einen Überblick über die NoSQL-Datenbank Neo4j geben. Insbesondere werde ich auf die Möglichkeiten eingehen, die Neo4j bei der Visualisierung von Graphen anbietet. Was ist...

Datenbank
NoSQL

18.6.2017 | 9 Minuten Lesezeit

Tobias Trelle

In love with Ada

Anyone out there remembering the Ada programming language? In this blog post, I’m going to give you a short introduction to Ada, the history of its name and some of the current occurrences in pop culture. Hello World in Ada To compile our first Ada program...

Software development
Raspberry Pi

10.4.2016 | 3 Minuten Lesezeit

Tobias Trelle

Joins and Schema Validation in MongoDB 3.2

Version 3.2 of the NoSQL database MongoDB introduces two new interesting features (amongst others) that I’d like to explore in this blog post. Joins The logical namespaces where documents are stored are called collections in MongoDB. Up to now every...

NoSQL
Big Data
Validation

7.12.2015 | 3 Minuten Lesezeit

Tobias Trelle

MongoDB-Einführung bei der Java-Usergruppe ruhrjug

Die Java-Enthusiasten im Ruhrgebiet treffen sich regelmäßig bei der ruhrjug , um sich über aktuelle Themen rund um die Programmiersprache Java auszutauschen. Beim letzten Treffen vor der Sommerpause am 25.06.2015 war ich eingeladen, um dort einen Vortrag...

Java
NoSQL
Community
Spring

1.7.2015 | 1 Minuten Lesezeit

Tobias Trelle

MongoDB 2.8 – Neue Storage-Engine WiredTiger

Mit Version 2.8 kommen wesentliche Neuerungen auf die Benutzer der NoSQL-Datenbank MongoDB zu. Eine davon ist die Einführung einer weiteren Storage Engine. Was es damit auf sich hat, werde ich in diesem Artikel erläutern. Bis zur Version 2.6 hat MongoDB...

Big Data
NoSQL

10.12.2014 | 4 Minuten Lesezeit

Tobias Trelle

MongoDB – Riesige Datenmengen schemafrei verwalten

MongoDB ist eine dokumentenorientierte NoSQL-Datenbank, die sich steigender Beliebtheit erfreut. In meinem Artikel MongoDB – Riesige Datenmengen schemafrei verwalten aus dem Java Magazin 5.14 gebe ich eine allgemeine kurze Einführung und erläutere die...

Datenbank
NoSQL

10.7.2014 | 1 Minuten Lesezeit

Tobias Trelle

MongoDB World 2014

For the very first time, the MongoDB community from all over the world gathered in one place. The MongoDB World conference 2014 took place in New York City from June 23rd to 25th. Talks The talks were separated into three topics: dev, ops & buisness...

Big Data
NoSQL
Community

6.7.2014 | 2 Minuten Lesezeit

Tobias Trelle

Test Automation for NoSQL Databases with NoSQL Unit and Travis-CI

Today I want to give you a short summary of my NoSQL matters talk on test automation for NoSQL databases . I basically introduce two tools that may help you with writing unit and integration tests for NoSQL databases: NoSQLUNit is a JUnit extension...

NoSQL
Testing
CI/CD

7.5.2014 | 1 Minuten Lesezeit

Tobias Trelle

Near-Realtime Analytics with MongoDB, Node.js & SmoothieCharts

In this blog post we’ll have a look at how easy it is to do some (near-)realtime analytics with your (big) data. I will use some well-known technologies like MongoDB and node.js and a lesser known JavaScript library called Smoothies Charts for realtime...

Big Data
Node.js

21.1.2014 | 4 Minuten Lesezeit

Tobias Trelle

MongoDB and Ruby

#MongoDB #Ruby I gave a lightning talk on the Ruby driver for MongoDB at the Cloud Developer Camp in Düsseldorf on last Saturday. Here are the slides: Click on the button to load the content from www.slideshare.net. Load content

NoSQL
Ruby

18.7.2013 | 1 Minuten Lesezeit

Tobias Trelle

MongoDB 2.4 Introduces Geospatial Indexing and Search for GeoJSON Geometries...

In case you are unfamiliar with the geospatial stuff, have a look at this introduction to geospatial indexing and searching with MongoDB . In version 2.4 MongoDB introduces support for a subset of GeoJSON geometries. These geometries can be used both...

JavaScript
Big Data
NoSQL

6.3.2013 | 3 Minuten Lesezeit

Tobias Trelle

OOP 2013: Praktische Einführung in MongoDB

Auf der OOP 2013 gab es von mir einen Vortrag zum Thema „Praktische Einführung in MongoDB“ Klicken Sie auf den unteren Button, um den Inhalt von de.slideshare.net zu laden. Inhalt laden Wer wollte, konnte sich MongoDB herunterladen und die Beispiele...

NoSQL
Community

1.2.2013 | 1 Minuten Lesezeit

Tobias Trelle

MongoDB Text Search Explained

The upcoming release 2.4 of MongoDB will include a first, experimental support for full text search (FTS). This feature was requested early in the history of MongoDB as you can see from this JIRA ticket: SERVER-380 . FTS is first available with the ...

NoSQL
Search
NLP

7.1.2013 | 5 Minuten Lesezeit

Tobias Trelle

Spring Batch and MongoDB

#springbatch #mongodb #nosql Spring Batch Spring Batch is a Spring-based framework for enterprise Java batch processing. An important aspect of Spring Batch is the separation between reading from and writing to resources and the processing of a single...

30.11.2012 | 5 Minuten Lesezeit

Tobias Trelle

Oliver Gierke on Spring Data and all the REST …

Today something completely different: I’ll interview Oliver Gierke from SpringSource . He we go … Tobias Trelle: Hi Oliver. Would you mind introducing yourself to listeners that might not already know you. Oliver Gierke: My name is Oliver Gierke. I ...

Data
Java
Community
Database
NoSQL
Spring

20.11.2012 | 10 Minuten Lesezeit

Tobias Trelle

Pessimistic Locking with MongoDB

In this article, I’m going to sketch a pattern for implementing pessimistic locking with MongoDB . MongoDB is a document-orientated NoSQL datastore that does not support locking itself. In some business processes it may be required that you have an ...

23.10.2012 | 3 Minuten Lesezeit

Tobias Trelle

GridFS Support in Spring Data MongoDB

MongoDB MongoDB is a highly scalable, document oriented NoSQL datastore from 10gen. For more information have a look at the MongoDB homepage: http://www.mongodb.org . A short introduction to MongoDB can be found at this blog post . GridFS In MongoDB ...

Cloud
Java
Infrastructure
NoSQL
Spring

26.7.2012 | 2 Minuten Lesezeit

Tobias Trelle

MonjaDB – A MongoDB GUI Client Tool

5.6.2012 | 1 Minuten Lesezeit

Tobias Trelle

Spring Data – Part 6: Redis

Redis Redis [1] is a NoSQL [2] key/value datastore. Think of it as a big, very fast persistent hashmap. Redis offers a master/slave data replication [3] and also a built-in publish/subscribe messaging system [4]. It is implemented in C and can be built...

Java
Cloud
NoSQL
Spring

26.4.2012 | 4 Minuten Lesezeit

Tobias Trelle

MongoDB User-Gruppe Düsseldorf

MongoDB MongoDB ist eine hochskalierbare, Dokumenten-orientierte NoSQL -Datenbank des Herstellers 10gen. Mehr Details finden Sie auf der MongoDB-Homepage: http://www.mongodb.org . Eine kurze Einleitung, die die ersten Schritte mit MongoDB erklärt, findet...

Cloud
NoSQL

22.4.2012 | 1 Minuten Lesezeit

Tobias Trelle

Spring Data – Part 4: Geospatial Queries with MongoDB

Introduction Every location-based service [1 ] has to solve the following problem: find all venues within a given distance from the current location of the user. Long before the advent of mobile devices, geographic information systems (GIS) [2 ] had ...

Cloud
NoSQL
Spring

15.3.2012 | 6 Minuten Lesezeit

Tobias Trelle

Spring Data – Part 5: Neo4j

Introduction Neo4j [1 ] is a high-performance NoSQL [2 ] datastore specialized in persisting graphs. A graph [3 ] is data structure consisting of finite sets of vertices and edges, where an edge is a connection between two vertices. Graphs are used to...

Software architecture
Java
Cloud
NoSQL
Spring

27.2.2012 | 4 Minuten Lesezeit

Tobias Trelle

Spring Data – Part 3: MongoDB

In this part of my blog series I’m going to show how easy it is to access a MongoDB datastore with Spring Data MongoDB. MongoDB MongoDB is a so called NoSQL datastore for document-oriented storage. A good place to start with MongoDB is the Developer...

Cloud
NoSQL
Spring

1.2.2012 | 5 Minuten Lesezeit

Tobias Trelle

Spring Data – Part 2: JPA

What happened before? Part 1: Spring Data Commons Part 2: Spring Data JPA After looking at the Spring Data Commons project in the first part of this blog series, today I’m going to introduce you to the sub project Spring Data JPA . JPA Being a part ...

Java
Software development
Spring

21.1.2012 | 3 Minuten Lesezeit

Tobias Trelle

Spring Data – Part 1: Commons

One goal of the Spring Data project is to provide a common API for accessing both NoSQL datastores and relational databases. Spring Data serves as an umbrella project which offers general solutions – like pagination in large result sets – and consists...

Spring

21.12.2011 | 2 Minuten Lesezeit

Tobias Trelle

Testing and Mocking of Static Methods in Java

Again and again I stumble upon the myth that static code is evil because it is hard to test and you can’t mock it. Architects and lead developers are telling that tale and the juniors are picking it up and repeating it: “Static code is evil. It is hard...

BDD
Java
Testing
Software development
Test Driven Development

10.11.2011 | 4 Minuten Lesezeit

Tobias Trelle

Cloud Computing Basics: the CAP Theorem

Almost unlimited scalability is an essential facet of cloud computing as it is offered by the Google App Engine or CloudFoundry. Insuring this feature leads to a trade-off with other nonfunctional aspects from enterprise computing like consistency. But...

Database
Cloud

28.8.2011 | 4 Minuten Lesezeit

Tobias Trelle

Documenting Custom Robot Framework Keyword Libraries

Right now, I’m introducing the robot framework for automated web tests for one of our customers. Beside the basic robot framework, we are using the SeleniumLibrary and RIDE . This tool stack is going to be rolled out to all software development teams...

Testing

14.8.2011 | 2 Minuten Lesezeit

Tobias Trelle

Quo vadis VMware? vFabric vs. Cloud Foundry

Introduction We will start with an introdcution of VMware’s cloud solutions vFabric and Cloud Foundry. After that, the further evolution of these PaaS platforms will be discussed. vFabric VMware offers his PaaS cloud solution vFabric Cloud Application...

Spring
Cloud

6.6.2011 | 3 Minuten Lesezeit

Tobias Trelle

AMQP Messaging mit RabbitMQ und Spring

RabbitMQ ist als Messaging-System Teil der vFabric Cloud Application Platform. Die Unterstützung des performanten Messaging Protokolls AMQP prädestiniert RabbitMQ für den Einsatz in Hochverfügbarkeitsszenarien. RabbitMQ ist ein Open-Source-Produkt ...

Cloud
Java
Softwareentwicklung
Messaging
Spring

20.4.2011 | 4 Minuten Lesezeit

Tobias Trelle

WebSphereMQ Integration using Mule ESB Community Edition

Mule ESB is an open source implementation of an enterprise service bus. In contrast to the free Community Edition, Mule’s commercial Enterprise Edition provides integration of WebSphereMQ servers out of the box. This article explains how to integrate...

Software architecture
Integration

11.3.2011 | 1 Minuten Lesezeit

Tobias Trelle

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

kibconfig – Wartungstool für Kibana Dashboards

Als wir vor 2 Jahren zu Beginn unseres Projekts damit begannen, unser ELK Logging über Kibana Dashboards zu optimieren, standen wir vor einem Problem: Wie konnten wir unsere für die PP-Umgebung vorbereiteten Dashboards, Visualisierungen und gespeicherten...

NoSQL
APM

12.10.2017 | 3 Minuten Lesezeit

Carsten Rohrbach

Graphen-Visualisierung mit Neo4j

Datenbank
NoSQL

18.6.2017 | 10 Minuten Lesezeit

Tobias Trelle

Elasticsearch: _type-Mapping zur Dateninspektion

ProblemsituationEine typische Situation: Daten aus einer Domän mit verschiedenen Sub-Domänen liegen in stark unterschiedlicher und unbekannter Form, mit ebenso unterschiedlichen und unbekannten Werten, vor. Sich mit diesen Daten auseinanderzusetzen ist...

NoSQL

5.12.2016 | 3 Minuten Lesezeit

Christian Börner-Schulte

Spring Boot & Apache CXF – Logging & Monitoring mit Logback, Elasticsearch...

SOAP-Endpoints auf Basis von Microservice-Technologien mit Spring Boot? Cool! Aber wie findet man bei den ganzen „Micro-Servern“ Fehler? Wie sehen die SOAP-Nachrichten aus und wie logge ich eigentlich generell? Und: wie viele Produkte haben wir eigentlich...

Frontend
NoSQL
Java
APM
Logging
Spring

26.7.2016 | 24 Minuten Lesezeit

Jonas Hecht

IoT-Analyse-Plattform

Internet of Things (IoT) oder auch Industrie 4.0 ist heute in aller Munde. Aber welche Herausforderungen stellen sich eigentlich bei der Verarbeitung großer Datenmengen? Eine Variante kann sein, Daten zu sammeln und später im Batch-Betrieb zu verarbeiten...

Cloud
IoT
NoSQL
Scala
Big Data

13.7.2016 | 14 Minuten Lesezeit

Achim Nierbeck

Elixir, Phoenix und CouchDB – Eine Einführung

Das Elixir MVC Framework PhoenixVon Markus Krogemann und Marcel WolfWorum geht es?Zunächst wird gezeigt, wie sich eine Webanwendung mit Phoenix in einfachen Schritten erstellen lässt, ohne dass ein tieferes Verständnis für eine funktionale Programmiersprache...

Softwareentwicklung
Functional programming
Frontend
NoSQL

13.1.2016 | 4 Minuten Lesezeit

Marcel Wolf

Joins und Schema-Validierung mit MongoDB 3.2

Mit Version 3.2 der dokumentenorientierten NoSQL-Datenbank MongoDB werden u.a. zwei lange vermisste(?) Features eingeführt, auf die ich im Folgenden näher eingehen möchte.JoinsDie logischen Namensräume, in denen man seine Dokumente ablegt, werden in...

NoSQL
Big Data
Validierung

7.12.2015 | 3 Minuten Lesezeit

Tobias Trelle

MongoDB-Einführung bei der Java-Usergruppe ruhrjug

Die Java-Enthusiasten im Ruhrgebiet treffen sich regelmäßig bei der ruhrjug , um sich über aktuelle Themen rund um die Programmiersprache Java auszutauschen.Beim letzten Treffen vor der Sommerpause am 25.06.2015 war ich eingeladen, um dort einen Vortrag...

Java
NoSQL
Community
Spring

1.7.2015 | 1 Minuten Lesezeit

Tobias Trelle

Cascaded Builder Pattern in Java

Wenn man mit dem Builder Pattern arbeitet, gelangt man an den Punkt, an dem man komplexe Objekte aufbauen muss. Nehmen wir nun an, dass wir ein Auto erzeugen möchten. Dieses besteht aus den Attributen Motor, Maschine und einer Anzahl Räder. Hierfür verwenden...

Java
Search

22.4.2015 | 6 Minuten Lesezeit

Sven Ruppert

Confess – Konferenzbericht

Von 14.-16.04.2015 fand die Confess, eine Konferenz für Enterprise Software Lösungen, statt. Sie wurde im C3 Convention Center in Wien veranstaltet. Auf der Konferenz waren hervorragende Speaker, wie Anton Arhipov, Maarten Mulders und Michael Plöd.Anton...

Community
Softwareentwicklung
NoSQL
Open Source
Java
Kubernetes
Microservices

21.4.2015 | 2 Minuten Lesezeit

Bernd Zuther

DataStax Tech-Day, die Zweite!

Vier Monate sind vergangen, seit wir den ersten Tech-Day gemeinsam mit unserem Partner DataStax in München durchgeführt hatten. Es war also an der Zeit, dieses Format auch in den hohen Norden, genauer gesagt in die Räumlichkeiten der codecentric nach...

NoSQL
Community

31.3.2015 | 2 Minuten Lesezeit

Silvio Tschapke

Big Data und Tiny Hardware – Teil 1

AbstractNachdem Ihr unsere „Big Data in a Box“-Lösung auf Schulungen und Usergroup-Treffen gesehen habt, haben wir immer wieder Anfragen zur Realisierung von Euch erhalten. Ihr wolltet wissen was wir dort gebaut haben und wie alles einzurichten ist. ...

Java
Open Source
Big Data
NoSQL

11.2.2015 | 3 Minuten Lesezeit

Dominique Ronde

MongoDB 2.8 – Neue Storage-Engine WiredTiger

Big Data
NoSQL

10.12.2014 | 4 Minuten Lesezeit

Tobias Trelle

MongoDB – Riesige Datenmengen schemafrei verwalten

Datenbank
NoSQL

10.7.2014 | 1 Minuten Lesezeit

Tobias Trelle

MongoDB Days München 2013

Am 14. Oktober fand in München zum 4. Mal die MongoDB Munich Konferenz statt. Dieses Jahr zog die Veranstaltung mit dem Hilton Hotel am Rosenheimer Platz an einen zentral gelegenen Ort an dem sich laut Veranstalter ca. 240 Anhänger der beliebten OpenSource...

NoSQL

15.10.2013 | 5 Minuten Lesezeit

Bastian Spanneberg

Einführung in Hadoop – Was ist Big Data & Hadoop? (Teil 1 von 3)

Was ist Big Data?„Big Data ist, wenn die Daten selbst Teil des Problems werden“Diese kurze Definition in Anlehnung an ein Zitat des Verantwortlichen für Marktforschung bei O’Reilly Media, Roger Magoulas, ist in meinen Augen die beste Charakterisierung...

Big Data
NoSQL

12.8.2013 | 5 Minuten Lesezeit

Uwe Printz

MongoDB und Ruby

#MongoDB #RubyAm vergangenen Samstag habe ich auf dem Cloud Developer Camp in Düsseldorf einen Vortrag über den Ruby-Treiber für MongoDB gehalten. Hier sind die Slides dazu:Klicken Sie auf den unteren Button, um den Inhalt von www.slideshare.net zu...

NoSQL
Ruby

18.7.2013 | 1 Minuten Lesezeit

Tobias Trelle

MongoDB für den Roboter

Wir setzen das Robot Framework seit geraumer Zeit für automatisierte Softwaretests in unseren Projekten ein. Außerdem beschäftigen sich ein paar meiner Kollegen mit der NoSql Datenbank MongoDB (Tutorial über MongoDB ). Die Dokumenten-Management-Lösung...

Agilität
Big Data
Open Source
NoSQL
Testing

6.6.2013 | 2 Minuten Lesezeit

Max Hartmann

OOP 2013: Praktische Einführung in MongoDB

Auf der OOP 2013 gab es von mir einen Vortrag zum Thema„Praktische Einführung in MongoDB“Klicken Sie auf den unteren Button, um den Inhalt von de.slideshare.net zu laden.Inhalt laden Wer wollte, konnte sich MongoDB herunterladen und die Beispiele live...

NoSQL
Community

1.2.2013 | 1 Minuten Lesezeit

Tobias Trelle

Oliver Gierke über Spring Data und den ganzen REST …

Heute mal was ganz anderes: ich führe ein Interview mit Oliver Gierke von SpringSource . Los geht’s …Tobias Trelle: Hallo Oliver. Möglicherweise gibt es Leser, die Dich noch nicht kennen. Könntest Du Dich bitte kurz vorstellen?Oliver Gierke: Mein Name...

Data
Java
Community
Datenbank
NoSQL
Spring

20.11.2012 | 9 Minuten Lesezeit

Tobias Trelle

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Du stehst vor einer großen IT-Herausforderung? Wir sorgen für eine maßgeschneiderte Unterstützung. Informiere dich jetzt.

Hilf uns, noch besser zu werden.

Wir sind immer auf der Suche nach neuen Talenten. Auch für dich ist die passende Stelle dabei.

Contact

Send

MongoDB Text Search Tutorial

API

Text Query Syntax

Negation

Phrase Search

Multi Language Support

Multiple Fields

Filtering and Projection

Examples

Summary

Was this post helpful?

Ja

Blog author

Get in contact

Get in contact

More articles

ctop – manage and monitor your Docker containers

Leaflet und GeoJSON-Daten

Google Cloud Function for Machine Learning

Google Cloud Natural Language API

Google Maps API und GeoJSON-Daten

RESTful Microservices on the Google Cloud Platform

GeoJSON Tutorial

Cloud Launcher for MongoDB in the Google Compute Engine

Deploying Spring Boot Applications in the Google AppEngine Flex Environment

EX-Raid-Arenen in Pokémon GO identifizieren

Change Streams in MongoDB 3.6

Spring Cloud Service Discovery with Dynamic Metadata

Lego WeDo 2.0 Programmierung

JUnit 5 – Des Kaisers neue Kleider

Unboxing Lego WeDo 2.0 Roboter Bausatz

Graphen-Visualisierung mit Neo4j

In love with Ada

Joins and Schema Validation in MongoDB 3.2

MongoDB-Einführung bei der Java-Usergruppe ruhrjug

MongoDB 2.8 – Neue Storage-Engine WiredTiger

MongoDB – Riesige Datenmengen schemafrei verwalten

MongoDB World 2014

Test Automation for NoSQL Databases with NoSQL Unit and Travis-CI

Near-Realtime Analytics with MongoDB, Node.js & SmoothieCharts

MongoDB and Ruby

MongoDB 2.4 Introduces Geospatial Indexing and Search for GeoJSON Geometries...

OOP 2013: Praktische Einführung in MongoDB

MongoDB Text Search Explained

Spring Batch and MongoDB

Oliver Gierke on Spring Data and all the REST …

Pessimistic Locking with MongoDB

GridFS Support in Spring Data MongoDB

MonjaDB – A MongoDB GUI Client Tool

Spring Data – Part 6: Redis

MongoDB User-Gruppe Düsseldorf

Spring Data – Part 4: Geospatial Queries with MongoDB

Spring Data – Part 5: Neo4j

Spring Data – Part 3: MongoDB

Spring Data – Part 2: JPA

Spring Data – Part 1: Commons

Testing and Mocking of Static Methods in Java

Cloud Computing Basics: the CAP Theorem

Documenting Custom Robot Framework Keyword Libraries

Quo vadis VMware? vFabric vs. Cloud Foundry

AMQP Messaging mit RabbitMQ und Spring

WebSphereMQ Integration using Mule ESB Community Edition

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

View Job

More articles in this subject area

kibconfig – Wartungstool für Kibana Dashboards

Graphen-Visualisierung mit Neo4j

Elasticsearch: _type-Mapping zur Dateninspektion

Spring Boot & Apache CXF – Logging & Monitoring mit Logback, Elasticsearch...

IoT-Analyse-Plattform

Elixir, Phoenix und CouchDB – Eine Einführung

Joins und Schema-Validierung mit MongoDB 3.2

MongoDB-Einführung bei der Java-Usergruppe ruhrjug

Cascaded Builder Pattern in Java

Confess – Konferenzbericht

DataStax Tech-Day, die Zweite!

Big Data und Tiny Hardware – Teil 1