aboutsummaryrefslogtreecommitdiff
Commit message (Collapse)AuthorAgeFilesLines
...
| * | | | Merge branch 'use-default-replication-factor' into fix-offset-topic-replicationStaffan Olsson2018-02-012-4/+0
| |\ \ \ \
| * | | | | Reverts previous commit and uses the documented default insteadStaffan Olsson2018-02-011-2/+1
| | | | | |
| * | | | | Should be larger than or equal min.insync.replicasStaffan Olsson2018-02-011-0/+1
| | | | | |
* | | | | | Merge pull request #95 from Yolean/ops-jobssolsson2018-02-035-0/+217
|\ \ \ \ \ \ | | | | | | | | | | | | | | Add Jobs and tests for common maintenance operations
| * | | | | | Text fixops-jobsStaffan Olsson2018-02-021-2/+2
| | | | | | |
| * | | | | | Merge branch 'increase-replication-factor' into ops-jobsStaffan Olsson2018-02-022-0/+75
| |\ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | motivated by https://github.com/Yolean/kubernetes-kafka/pull/140
| | * | | | | | Explains why this operation is hereStaffan Olsson2018-02-021-0/+10
| | | | | | | |
| | * | | | | | Clarifies output a bit moreStaffan Olsson2018-02-021-1/+2
| | | | | | | |
| | * | | | | | Got weird overlapping outputStaffan Olsson2018-02-021-0/+1
| | | | | | | |
| | * | | | | | Adds the necessary json mod to require more replicas,Staffan Olsson2018-02-021-5/+17
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | which seems to be equivalent to setting replication.factor 3 at topic create.
| | * | | | | | Starts from the reassign-partitions jobStaffan Olsson2018-02-021-0/+51
| |/ / / / / /
| * | | | | | The first two tasks under Partition Management in the Kafka bookStaffan Olsson2017-11-101-0/+22
| | | | | | |
| * | | | | | Adds a job that can actually fix under-replication ...Staffan Olsson2017-11-101-0/+51
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | as long as you have one up-to-date replica. The use case is broker outages that are longer than mere re-scheduling, such as zone outage in a multi-zone cluster.
| * | | | | | But note that this job won't actually trigger replication ...Staffan Olsson2017-11-101-0/+0
| | | | | | | | | | | | | | | | | | | | | | | | | | | | of under-replicated topics if a broker is down
| * | | | | | Fixes container log to show unreplicated partitions (if there are any)Staffan Olsson2017-11-101-3/+9
| | | | | | |
| * | | | | | Adds job to simplify replica electionStaffan Olsson2017-11-092-4/+21
| | | | | | |
| * | | | | | Adds test that goes unready when there's not enough replicas of >0 topicsStaffan Olsson2017-11-091-0/+46
| | | | | | |
* | | | | | | Merge pull request #128 from Yolean/metrics-jmx-for-brokerssolsson2018-02-032-0/+87
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | Add Kafka Prometheus metrics export
| * | | | | | | There's only values in this metric typemetrics-jmx-for-brokersStaffan Olsson2018-02-031-1/+1
| | | | | | | |
| * | | | | | | At first i suspected that order matters, but this also worksStaffan Olsson2018-02-031-2/+2
| | | | | | | |
| * | | | | | | ”In a production Kafka cluster, an offline partitionStaffan Olsson2018-02-031-1/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | may be impacting the producer clients, losing messages or causing back-pressure in the application. This is most often a “site down” type of problem and will need to be addressed immediately.” Excerpt from: Neha Narkhede, Gwen Shapira, and Todd Palino. ”Kafka: The Definitive Guide”. We now export kafka_controller_kafkacontroller_value{name="OfflinePartitionsCount",} and friends. See #140 for why.
| * | | | | | | It was the liveness probe that killed the metrics container,Staffan Olsson2018-01-291-2/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | and we might not need liveness if we have alerts for stale metrics. This reverts commit f1e6e96231465775ffa0ded5b745774182fec5ff.
| * | | | | | | Removes liveness probes, to focus on memory limits,Staffan Olsson2018-01-191-5/+0
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | and rely on metric staleness alerts instead for exporter liveness. This reverts commit 74a5177270301f3cbe0342657d8bcf70d1ae76ed.
| * | | | | | | With 120M you get OOMKilled even on brokers with modest metrics volumesStaffan Olsson2018-01-191-2/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This reverts commit 66255af447373f9b14e338a4b819dddad483a257.
| * | | | | | | With kubectl patch, this folder kan be scoped as Prometheus supportStaffan Olsson2018-01-192-1/+1
| | | | | | | |
| * | | | | | | On start the metrics pod gesses that it has ~45%Staffan Olsson2018-01-191-2/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | of the memory limit as "Max. Heap Size (Estimated)" Reducing limits as experiment for #112.
| * | | | | | | Evaluates JVM memory limit awarenessStaffan Olsson2018-01-191-2/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Interesting input for #112, for use with broker and zk pods in addition to KAFKA_HEAP_OPTS.
| * | | | | | | This is an optional feature, but belongs to the broker pods,Staffan Olsson2018-01-191-0/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | so let's evaluate kubectl patch to add it
| * | | | | | | Adds liveness probe from the metrics-improve-scrape-times branchStaffan Olsson2018-01-191-0/+5
| | | | | | | |
| * | | | | | | Upgrades jmx exporter to 0.2.0Staffan Olsson2018-01-191-1/+1
| | | | | | | |
| * | | | | | | The metrics part of #49Staffan Olsson2018-01-191-0/+38
| | | | | | | |
| * | | | | | | Current config from the metrics-improve-scrape-times branch,Staffan Olsson2018-01-191-0/+44
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | see https://github.com/Yolean/kubernetes-kafka/pull/49
* | | | | | | | Merge pull request #135 from Yolean/use-default-replication-factorsolsson2018-02-012-4/+0
|\ \ \ \ \ \ \ \ | |_|_|/ / / / / |/| | | / / / / | | |_|/ / / / | |/| | | | | Update test topic replication factor
| * | | | | | Uses brokers' default replication factor for explicit topic creationuse-default-replication-factorStaffan Olsson2018-01-222-4/+0
| |/ / / / /
* | | | | | Merge pull request #138 from deadmoose/patch-1solsson2018-02-011-1/+1
|\ \ \ \ \ \ | | | | | | | | | | | | | | Speeling
| * | | | | | SpeelingDavid Hoover2018-01-311-1/+1
|/ / / / / /
* | | | | | Merge pull request #125 from Yolean/linkedin-burrowsolsson2018-01-313-0/+114
|\ \ \ \ \ \ | | | | | | | | | | | | | | Add Burrow for consumer lag monitoring
| * | | | | | Upgrades to current Burrow master, a91cf4dlinkedin-burrowStaffan Olsson2018-01-311-1/+1
| | | | | | |
| * | | | | | Applies the fix for v3 API so now we export five metricsStaffan Olsson2018-01-191-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | https://github.com/jirwin/burrow_exporter/pull/9
| * | | | | | Adds a frontend container. No need to scale it indivudually.Staffan Olsson2018-01-192-4/+29
| | | | | | |
| * | | | | | Lets us see all consumers, for nowStaffan Olsson2018-01-191-2/+2
| | | | | | |
| * | | | | | Adds a service. Ordering is ok with plain file names here.Staffan Olsson2018-01-193-0/+12
| | | | | | |
| * | | | | | The build from search-and-replace in github.com/jirwin/burrow_exporter/pull/9Staffan Olsson2018-01-171-1/+1
| | | | | | |
| * | | | | | Adds prometheus exporter, but sadly incompatible with current BurrowStaffan Olsson2018-01-171-0/+14
| | | | | | | | | | | | | | | | | | | | | | | | | | | | https://github.com/jirwin/burrow_exporter/issues/8
| * | | | | | fixes container nameStaffan Olsson2018-01-171-1/+1
| | | | | | |
| * | | | | | Lists actual brokersStaffan Olsson2018-01-171-2/+2
| | | | | | |
| * | | | | | Adds linkedin/Burrow for use with port-forward 8000Staffan Olsson2018-01-172-0/+63
| | | | | | |
* | | | | | | Merge pull request #137 from Yolean/outside-host-as-annotationsolsson2018-01-291-1/+5
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | Move outside host info from labels to annotations
| * | | | | | | Moves outside host details to annotations,outside-host-as-annotationStaffan Olsson2018-01-281-1/+5
|/ / / / / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | as DNS names may exceed the 63 char limit on label values and they're not too useful for pod matching in services etc.
* | | | | | | Merge pull request #130 from lenadroid/mastersolsson2018-01-213-1/+19
|\ \ \ \ \ \ \ | |_|/ / / / / |/| | | | | | Adding storage class files for AKS