Search engines *

From Bing to Google

ArticlesPostsNewsAuthors

tommy113 Jul 18 2023 at 11:11

SEO Tips to Magento 2 Product Pages

3 min

574

Search engines * Product Management * Magento *

From sandbox

Avoid duplicate content

Many online merchants face a common challenge with multiple product variations, leading to duplicate content issues. When similar products have slightly different URLs like "?=sortby" or "?p=2", search engines may view them as duplicates, impacting your website's credibility and search rankings. To tackle this:

eugeniouglov Feb 19 2023 at 16:42

How to increase speed and flexibility of searching files

2 min

609

Search engines *

In a previous article, I described the logic of the project to search for personal information by tagging, but that was for the web version.

Searching for files on a PC is a bit different and I would like to touch on this topic.

vsprog Feb 16 2023 at 22:00

Elasticsearch as NoSQL Database

Easy

13 min

3.4K

Distributed systems * NoSQL * Search engines *

From sandbox

In this article, I will introduce NoSQL concepts and show how they are related to Elasticsearch, and we will consider this search engine as a NoSQL document store.

eugeniouglov Feb 9 2023 at 22:16

How I wrote my search engine to quickly find personal information

6 min

1.7K

Data storagingStart-up developmentWebsite development * Search engines * Programming *

Opinion

Translation

Search your own data like in google search engine.

IgKend Oct 21 2022 at 18:42

How Yandex Made Their Biggest Improvement in the Search Engine with the Help of Toloka

5 min

2.4K

Search engines * Data Mining * Machine learning * Artificial IntelligenceData Engineering *

Tutorial

Toloka is a crowdsourcing platform and microtasking project launched by Yandex to quickly markup large amounts of data. But how can such a simple concept play a crucial role in improving the work of neural networks?

Learn how

SergeyBPshenichnikov Jun 8 2022 at 15:38

Algebra of text without formulas

64 min

2.1K

Search engines * Semantics * Algorithms * Natural Language Processing *

Translation

The article is an abstract of my book [1] based on previously presented publications [2], [3], [4], [5]

SergeyBPshenichnikov Jun 7 2022 at 19:41

Collective meaning recognition

37 min

1.6K

Search engines * Semantics * Algorithms * Natural Language Processing *

Translation

The published material is in the Appendix of my book [1]

Modern civilization finds itself at a crossroads in which to choose the meaning of life. Because of the development of technology, the majority of the world's population may be "superfluous" - not in demand in the production of values. There is another option, where each person is a supreme value, an absolute individual and can be indispensably useful in the technology of the collective mind.

In the eighties of the last century, the task of creating a scientific field of "collective intelligence" was set. Collective intelligence is defined as the ability of the collective to find solutions to problems more effectively than each participant individually. The right collective mind must be...

SergeyBPshenichnikov Dec 1 2021 at 18:06

Concordance of sense

17 min

1.1K

Natural Language Processing * Algorithms * Semantics * Search engines *

Translation

In [1,2,3] texts (sign sequences with repetitions) were transformed (coordinated) into algebraic systems using matrix units as word images. Coordinatization is a necessary condition of algebraization of any subject area. Function (arrow) (7) in [1]) is a matrix coordinatization of text. One can perform algebraic operations with words and fragments of matrix texts as with integers, but taking into account the noncommutativity of multiplication of words as matrices. Structurization of texts is reduced to the calculation of ideals and categories of texts in matrix form.

SergeyBPshenichnikov Apr 23 2021 at 10:01

Context category

12 min

1.5K

Search engines * Semantics * Algorithms * Natural Language Processing *

Translation

The mathematical model of signed sequences with repetitions (texts) is a multiset. The multiset was defined by D. Knuth in 1969 and later studied in detail by A. B. Petrovsky [1]. The universal property of a multiset is the existence of identical elements. The limiting case of a multiset with unit multiplicities of elements is a set. A set with unit multiplicities corresponding to a multiset is called its generating set or domain. A set with zero multiplicity is an empty set.

SergeyBPshenichnikov Apr 14 2021 at 15:13

Algebra of text. Examples

5 min

1.8K

Natural Language Processing * Algorithms * Semantics * Search engines *

Translation

The previous work from ref [1] describes the method of transforming a sign sequence into algebra through an example of a linguistic text. Two other examples of algebraic structuring of texts of a different nature are given to illustrate the method.

SergeyBPshenichnikov Mar 28 2021 at 16:09

Converting text into algebra

10 min

1.6K

Search engines * Natural Language Processing * Semantics * Algorithms *

Translation

Algebra and language (writing) are two different learning tools. When they are combined, we can expect new methods of machine understanding to emerge. To determine the meaning (to understand) is to calculate how the part relates to the whole. Modern search algorithms already perform the task of meaning recognition, and Google’s tensor processors perform matrix multiplications (convolutions) necessary in an algebraic approach. At the same time, semantic analysis mainly uses statistical methods. Using statistics in algebra, for instance, when looking for signs of numbers divisibility, would simply be strange. Algebraic apparatus is also useful for interpreting the calculations results when recognizing the meaning of a text.

p4ymak Mar 5 2020 at 11:37

Ray Cast Visual Search (RCVS). Fast and simple algorithm for searching 3D objects with similar shapes

8 min

3.2K

CGI * Algorithms * Search engines * 3D-graphics *

For me, these two models are quite similar, but in fact they don’t have obvious characteristics to measure this similarity. These models have different numbers of vertices, edges and polygons. They are of different sizes, rotated differently and both have the same transforms (Location = [0,0,0], Rotation in radians = [0,0,0], Scale = [1,1,1]). So how to determine their similarity?

VasylArtiushchenko Jan 14 2020 at 22:39

10 SEO Myths to Leave Behind in 2020

6 min

1.3K

Search engine optimization * Search engines *

To say SEO has “changed a lot” would be the understatement of the decade. We’ll often see multiple updates per year from Google, like the BERT update in October aimed at helping the search engine better interpret natural language searches. Or the site diversity update in June, which focused on reducing duplicate organic listings on SERPs for the same site.

VasylArtiushchenko Dec 12 2019 at 21:58

SEO vs. PPC — What's better for your business?

8 min

2.5K

Search engines * Search engine optimization * Internet marketing * Display advertising * Contextual advertising *

What's better for your business?

At a certain point, any website owner wonders what's better: SEO or PPC? Which promotion strategy will be the most rational to use in this particular situation? Or maybe it's best to combine both?

Before you decide between SEO and PPC, you need to consider the differences between them…

-2

VasylArtiushchenko Aug 2 2019 at 14:34

International SEO | International SEO ranking factors

6 min

1.4K

Internet marketing * Search engine optimization * Search engines *

Let's say, your website offers content, products, or services for people from different regions or countries who speak different languages. Search engines will probably count this as duplicate content, leading to low rankings.

international SEO

+10

stefanbuzz Mar 28 2019 at 06:55

PVS-Studio for Java hits the road. Next stop is Elasticsearch

11 min

2.2K

PVS-Studio corporate blogJava * Open source * Search engines *

The PVS-Studio team has been keeping the blog about the checks of open-source projects by the same-name static code analyzer for many years. To date, more than 300 projects have been checked, the base of errors contains more than 12000 cases. Initially the analyzer was implemented for checking C and C++ code, support of C# was added later. Therefore, from all checked projects the majority (> 80%) accounts for C and C++. Quite recently Java was added to the list of supported languages, which means that there is now a whole new open world for PVS-Studio, so it's time to complement the base with errors from Java projects.

The Java world is vast and varied, so one doesn't even know where to look first when choosing a project to test the new analyzer. Ultimately, the choice fell on the full-text search and analytical engine Elasticsearch. It is quite a successful project, and it's even especially pleasant to find errors in significant projects. So, what defects did PVS-Studio for Java manage to detect? Further talk will be right about the results of the check.

+25

ashotog Mar 10 2019 at 07:48

How to Discover MongoDB and Elasticsearch Open Databases

3 min

17K

Database Administration * Information Security * Search engines * System administration *

Some time ago among security researchers, it was very “fashionable” to find improperly configured AWS cloud storages with various kinds of confidential information. At that time, I even published a small note about how Amazon S3 open cloud storage is discovered.

However, time passes and the focus in research has shifted to the search for unsecured and exposed public domain databases. More than half of the known cases of large data leaks over the past year are leaks from open databases.

Today we will try to figure out how such databases are discovered by security researchers...

+16