This repository is here to guide you through the GitHub tutorial that goes hand-in-hand with a video available on YouTube and a detailed blog post on my website. Together, these resources are designed to give you a complete understanding of the topic.
Here are the links to the related assets:
- YouTube Video: How to configure Fluent Bit to collect Logs for our K8S cluster
- Blog Post: How to configure Fluent Bit to collect logs for your Kubernetes cluster
Feel free to explore the materials, star the repository, and follow along at your own pace.
This repository showcases the usage of Loki by using GKE with the HipsterShop.
The following tools need to be installed on your machine :
- jq
- kubectl
- git
- gcloud (if you're using GKE)
- Helm
PROJECT_ID="<your-project-id>"
gcloud services enable container.googleapis.com --project ${PROJECT_ID}
gcloud services enable monitoring.googleapis.com \
cloudtrace.googleapis.com \
clouddebugger.googleapis.com \
cloudprofiler.googleapis.com \
--project ${PROJECT_ID}
ZONE=us-central1-b
gcloud containr clusters create isitobservable \
--project=${PROJECT_ID} --zone=${ZONE} \
--machine-type=e2-standard-2 --num-nodes=4
git clone https://github.com/isItObservable/Episode3--Kubernetes-Fluentbit.git
cd Episode3--Kubernetes-Fluentbit
cd hipstershop
./setup.sh
Prometheus (as already done during Episode 1)
helm install prometheus stable/prometheus-operator
kubectl get svc
kubectl edit svc prometheus-grafana
change to type NodePort
apiVersion: v1
kind: Service
metadata:
annotations:
meta.helm.sh/release-name: prometheus
meta.helm.sh/release-namespace: default
labels:
app.kubernetes.io/instance: prometheus
app.kubernetes.io/managed-by: Helm
app.kubernetes.io/name: grafana
app.kubernetes.io/version: 7.0.3
helm.sh/chart: grafana-5.3.0
name: prometheus-grafana
namespace: default
resourceVersion: "89873265"
selfLink: /api/v1/namespaces/default/services/prometheus-grafana
spec:
clusterIP: IPADRESSS
externalTrafficPolicy: Cluster
ports:
- name: service
nodePort: 30806
port: 80
protocol: TCP
targetPort: 3000
selector:
app.kubernetes.io/instance: prometheus
app.kubernetes.io/name: grafana
sessionAffinity: None
type: NodePort
status:
loadBalancer: {}
Deploy the ingress by making sure to replace the service name of your Grafana
cd ..\grafana
kubectl apply -f ingress.yaml
Get the login user and password of Grafana
- For the password :
kubectl get secret --namespace default prometheus-grafana -o jsonpath="{.data.admin-password}" | base64 --decode
- For the login user:
kubectl get secret --namespace default prometheus-grafana -o jsonpath="{.data.admin-user}" | base64 --decode
Get the ip adress of your Grafana
kubectl get ingress grafana-ingress -ojson | jq '.status.loadBalancer.ingress[].ip'
helm repo add loki https://grafana.github.io/loki/charts
helm repo update
helm upgrade --install loki loki/loki-stack --set fluent-bit.enabled=true,promtail.enabled=false
In order to build a dashboard with data stored in Loki, we first need to add a new DataSource. In Grafana, go to Configuration/Add data source.
Select the source Loki, and configure the URL to interact with it.Remember, Grafana is hosted in the same namespace as Loki. So you can simply refer to the Loki service :
In Grafana, select Explore on the main menu Select the datasource Loki. In the drop-down menu, select the label product -> hipster-shop
Loki has a specific query language that allows you to filter, transform the data, and even plot a metric from your logs in a graph. Similar to Prometheus, you need to :
- filter using labels : {app="frontend",product="hipster-shop" ,stream="stdout"} We're here only looking at the logs from hipster-shop, app frontend, and on the logs pushed in stdout.
- transform using | for example :
{namespace="hipster-shop",stream="stdout"} | json | http_resp_took_ms >10
The first |
specifies to Grafana to use the JSON parser that will extract all the JSON properties as labels.
The second |
will filter the logs on the new labels created by the JSON parser.
In this example, we want to only get the logs where the attribute http.resp.took.ms is above 10ms ( the json parser is replace . by _)
We can then extract on the field to plot it using all the various functions available in Grafana
If you want to plot the response time over time, you could use the function :
rate({namespace="hipster-shop" } |="stdout" !="error" |= "debug" |="http.resp.took_ms" [30s])
Now that we have used the default configuration with Loki, let's deploy the standard Fluentbit and explore the settings.
helm repo add fluent https://fluent.github.io/helm-charts
helm install fluent-bit fluent/fluent-bit
The configuration file is stored in a ConfigMap
kubectl get cm
[SERVICE]
Flush 1
Daemon Off
Log_Level info
Parsers_File parsers.conf
HTTP_Server On
HTTP_Listen 0.0.0.0
HTTP_Port 2020
[INPUT]
Name tail
Path /var/log/containers/*.log
Parser docker
Tag kube.*
Mem_Buf_Limit 5MB
Skip_Long_Lines On
[INPUT]
Name systemd
Tag host.*
Systemd_Filter _SYSTEMD_UNIT=kubelet.service
Read_From_Tail On*
Now that we have the default configuration to collect logs of our Pods Let's see how to filter and change the log stream
Let's add a Filter block to our current Fluent Bit pipeline
[FILTER]
Name kubernetes
Match kube.*
Merge_Log On
Merge_Log_Trim On
Labels Off
Annotations Off
K8S-Logging.Parser Off
K8S-Logging.Exclude Off
And an output plugin to see the transformed log in Stdout ( of our fluentbit pods)
[OUTPUT]
Name stdout
Match *
Format json
Json_date_key timestamp
Json_date_format iso8601
If you don't have any Dynatrace tenant, then let's start a trial Set up the Dynatrace K8s operator following the steps described in the documentation
In order to collect logs in Dynatrace, you'll also need to install the Active Gate.* Follow the documentation to install the Active Gate on a seperate server
Now we need to rename the log to content, and rename the Kubernetes information with the right fields.
[FILTER]
Name modify
Match *
Rename log content
Let's use the nest filter plugin to move the kubernetes tags
[FILTER]
Name nest
Match kube.*
Operation lift
Nested_under kubernetes
Add_prefix kubernetes_
Let's use modify plugin to rename and remove the non relevant tags
[FILTER]
Name modify
Match kube.*
Rename log content
Rename kubernetes_pod_name k8s.pod.name
Rename kubernetes_namespace_name k8s.namespace.name
Remove kubernetes_container_image
Remove kubernetes_docker_id
Remove kubernetes_container_name
Remove kubernetes_pod_id
Remove kubernetes_host
Remove time
Remove kubernetes_container_hash
Add k8s.cluster.name Onlineboutique
The Dynatrace ingest API is limiting the number of calls per minute. We need to throttle the streams :
[FILTER]
Name throttle
Match *
Rate 100
Window 100
Interval 1m
Last, we can now connect the Dynatrace API using the HTTP output plugin
[OUTPUT]
Name http
Match *
host YOURHOST
port 9999
URI /e/<DYNATRACE TENANT ID>/api/v2/logs/ingest
header Authorization Api-Token <DYNATRACE API TOKEN>
header Content-Type application/json
Format json
Json_date_key timestamp
Json_date_format iso8601
tls On
tls.verify Off
Let's open go to calyptia to visualize our log stream pipeline: