You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/about-us/history.md
+6-3Lines changed: 6 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,7 +19,10 @@ As of April 2014, Yandex.Metrica was tracking about 12 billion events (page view
19
19
## Usage in Yandex.Metrica and other Yandex services {#usage-in-yandex-metrica-and-other-yandex-services}
20
20
21
21
ClickHouse serves multiple purposes in Yandex.Metrica.
22
-
Its main task is to build reports in online mode using non-aggregated data. It uses a cluster of 374 servers, which store over 20.3 trillion rows in the database. The volume of compressed data is about 2 PB, without accounting for duplicates and replicas. The volume of uncompressed data (in TSV format) would be approximately 17 PB.
22
+
Its main task is to build reports in online mode using non-aggregated data.
23
+
It uses a cluster of 374 servers, which store over 20.3 trillion rows in the database.
24
+
The volume of compressed data is about 2 PB, without accounting for duplicates and replicas.
25
+
The volume of uncompressed data (in TSV format) would be approximately 17 PB.
23
26
24
27
ClickHouse also plays a key role in the following processes:
25
28
@@ -29,13 +32,13 @@ ClickHouse also plays a key role in the following processes:
29
32
- Running queries for debugging the Yandex.Metrica engine.
30
33
- Analyzing logs from the API and the user interface.
31
34
32
-
Nowadays, there are a multiple dozen ClickHouse installations in other Yandex services and departments: search verticals, e-commerce, advertisement, business analytics, mobile development, personal services, and others.
35
+
Nowadays, there are multiple dozen ClickHouse installations in other Yandex services and departments: search verticals, e-commerce, advertisement, business analytics, mobile development, personal services, and others.
33
36
34
37
## Aggregated and non-aggregated data {#aggregated-and-non-aggregated-data}
35
38
36
39
There is a widespread opinion that to calculate statistics effectively, you must aggregate data since this reduces the volume of data.
37
40
38
-
However data aggregation comes with a lot of limitations:
41
+
However, data aggregation comes with a lot of limitations:
39
42
40
43
- You must have a pre-defined list of required reports.
0 commit comments