Commit Graph

79 Commits

Author SHA1 Message Date
Jon
03affbee80 feat(labels): lables moved to values.yaml
!4
2023-09-29 19:08:48 +09:30
Jon
65b61ecf54 feat(graph): add ceph size/used to dashboard
!4
2023-09-29 10:57:28 +09:30
Jon
cf38a4156a refactor(grafana_datasources): default prom set to thanos
!4
2023-09-28 09:09:53 +09:30
Jon
7afc6aa515 feat(prometheus): rule selector configurable from values.yaml
!4
2023-09-28 09:08:26 +09:30
Jon
2371adcbc5 feat(promtail): split node to seperate promtail service monitor
this allows the mixins to work, specifically promtail

!4
2023-09-28 09:07:11 +09:30
Jon
f73a9e462e feat(grafana_agent): add prom rules
!4
2023-09-28 09:05:01 +09:30
Jon
5bb7197129 feat(prometheus): tsdb retention set
!4
2023-09-28 02:06:06 +09:30
Jon
be0161876d feat(thanos): prometheus rules added for sidecar
!4
2023-09-28 02:06:06 +09:30
014156eb05 build(version): bump version 0.0.1 → 0.1.0 0.1.0 2023-09-27 15:36:31 +00:00
Jon
376cf3ae9f Merge branch '1-metrics-long-term-storage' into 'development'
feat(metrics): long term storage injection

Closes #1

See merge request nofusscomputing/projects/kubernetes_monitoring!2
2023-09-27 15:31:08 +00:00
Jon
c76b753618 docs: update
!2
2023-09-28 00:53:48 +09:30
Jon
6dd413a0d4 docs(metrics_storage): added docs on how to configure external metrics routing
!2 closes #1
2023-09-28 00:48:00 +09:30
Jon
ffc0f84669 refactor(dashboard): nfc cluster overview to use source variable
!2
2023-09-28 00:33:43 +09:30
Jon
5add73c411 feat(grafana): DataSources configurable in values.yaml
!2 #1
2023-09-28 00:32:43 +09:30
Jon
239837ecbf feat(prometheus): ability to add additional config settings
settings are pertinent to the kubernetes prometheus object used by the prom operator.

!2 #1
2023-09-28 00:27:49 +09:30
Jon
ac812c2c7b feat(prometheus): thanos sidecar option added
!2 #1
2023-09-28 00:15:40 +09:30
Jon
9b12fcf5bc Merge branch 'genesis' into 'development'
chore: migrate chart to public repo

See merge request nofusscomputing/projects/kubernetes_monitoring!1
2023-09-27 05:24:50 +00:00
Jon
d4e529aec9 docs: add more features
!1
2023-09-27 14:47:19 +09:30
Jon
e7480105f7 feat(grafana_dashboard): nfc custom, cluster overview
!1
2023-09-27 14:39:43 +09:30
Jon
c8ea929873 feat(kyverno): add clusterpolicy role and rolebinding
cluster policy creates the role and rolebindings for prometheuse to monitor the ns

!1
2023-09-26 06:27:20 +09:30
Jon
899c6a3d78 refactor(prometheus): adjusted role/rolbinding manifest to loop
!1
2023-09-26 06:25:19 +09:30
Jon
817b838655 refactor(grafana_agent): clean up config file
!1
2023-09-26 01:30:06 +09:30
Jon
b7cfebf92b fix(kubestateproxy): bump proxy cpu limit
!1
2023-09-25 16:49:18 +09:30
Jon
e3648324f2 feat(grafana_agent): attach hostname to logs and metrics
!1
2023-09-25 16:46:21 +09:30
Jon
a26f887fa4 fix(grafana): use named pvc so it's reused
templated pvc as using a different name al the time. this caused left
over pvc that were unused. name set so they can be reused

!1
2023-09-25 16:42:45 +09:30
Jon
8f53b0fa07 feat(grafana): sidecar for loading dashboards from configmap
!1
2023-09-25 16:42:45 +09:30
Jon
82cd91a859 docs: added missing attribution
!1
2023-09-25 16:42:45 +09:30
Jon
944d615331 fix(ceph): PromRule CephPGImbalance adjusted to group by node
balancing is done by hostname not osd.

!1
2023-09-25 16:42:45 +09:30
Jon
4198ac78da docs: add blurb on values
!1
2023-09-25 16:42:45 +09:30
Jon
3a4ca30e68 feat(servicemonitor): ceph use cluster name for instance
!1
2023-09-25 16:42:45 +09:30
Jon
e2540a955f feat(servicemonitor): for prometheus, use pod name for instance
!1
2023-09-25 16:42:45 +09:30
Jon
89f0feae08 fix(grafana): dont use operator readiness prob
operator defines endpoint :3000/api/health, which fails with invalid argument

!1
2023-09-25 16:42:45 +09:30
Jon
0d1d0a34d8 feat(grafana): add affinity to values
!1
2023-09-25 16:42:45 +09:30
Jon
83fb30b4ed feat(prometheus): add storage to values
!1
2023-09-25 16:42:45 +09:30
Jon
7add74fbb1 refactor: yaml object ordering
!1
2023-09-25 16:42:45 +09:30
Jon
c742eea38d feat(prometheus_adaptor): add affinity to values
!1
2023-09-25 16:42:45 +09:30
Jon
836cc111e0 feat(prometheus): add affinity to values
!1
2023-09-25 16:42:45 +09:30
Jon
d189dfe0ee chore(versions): update image versions alertmanager,grafana
!1
2023-09-25 16:42:45 +09:30
Jon
2331bcbba3 feat(logging): add log files from /var/log
!1
2023-09-25 16:42:45 +09:30
Jon
8b966b0f0a feat(grafana-agent): expand env vars in config
this allows using the hosts env vars in the config file

!1
2023-09-21 18:55:49 +09:30
Jon
3281cd6552 refactor(prometheus): use values ns var
!1
2023-09-21 12:53:34 +09:30
Jon
3f2204cf31 feat(deployment): grafana-agent dnspoly clusterfirst
!1
2023-09-21 01:02:41 +09:30
Jon
cbac640af3 feat(promtail): filter devices to not include temp/virtual
!1
2023-09-21 01:02:00 +09:30
Jon
64d612047f refactor: use loki ns var
!1
2023-09-21 01:01:25 +09:30
Jon
4e742ee2d5 feat(metrics): node scraper set to 5s
!1
2023-09-21 01:00:53 +09:30
Jon
4ced83b90d feat(loik): be able to specify full loki url 2023-09-19 23:20:57 +09:30
Jon
a28126555f feat(loki): add prometheus alerts and rules for mixins
!1
2023-09-19 23:19:45 +09:30
Jon
aba3cb22a9 feat(loki): removed service monitor
a service monitor is included with loki

!1
2023-09-19 23:19:14 +09:30
Jon
63c0381bd1 feat(metrics): Calico service and monitor added
!1
2023-09-19 23:18:27 +09:30
Jon
9e3f33ae56 feat(node): added dashboard
!1
2023-09-19 19:02:32 +09:30