Why does kubectl logs --follow show duplicate output when used in a loop to watch Kubernetes Job logs?

Ask Question

Asked 3 months ago

Modified 3 months ago

Viewed 110 times

Part of CI/CD Collective

I'm running a Kubernetes Job from a Jenkins pipeline and want to stream its logs until completion.

I currently use this pattern in a Bash script:

job_status_cmd_complete="kubectl get job ${kube_job_name} --namespace=${kube_namespace} -o jsonpath={..status.conditions[?(@.type=='Complete')].status}"
job_status_cmd_failed="kubectl get job ${kube_job_name} --namespace=${kube_namespace} -o jsonpath={..status.conditions[?(@.type=='Failed')].status}"

function jobIsFinished {
  if [[ $($job_status_cmd_complete) == 'True' || $($job_status_cmd_failed) == 'True' ]]; then
    echo 'True'
  else
    echo 'False'
  fi
}

# First log stream — immediately after pod is ready
kubectl logs --tail=-1 --follow --selector=job-name=$job_name --namespace=$namespace -c $job_name

# Loop to check for completion
while [[ $(jobIsFinished) != 'True' ]]; do
  # Second log stream to catch more output in case the job is still running
  kubectl logs --tail=100 --follow --selector=job-name=$job_name --namespace=$namespace -c $job_name
done

Why I do this:

The first kubectl logs --follow is to get logs as soon as the pod is ready.
The second one in the loop is a safeguard to continue watching if the job is long-running or restarts.
This was intended to ensure I don't miss logs if the first stream gets disconnected or if the pod is slow to start logging.

Issue:

This approach sometimes causes duplicate logs, especially in Jenkins output.
It's not clear whether --follow maintains log offset or restarts from the beginning in each call.

Is kubectl logs --follow stateless and prone to duplication if used multiple times?

edited Aug 6 at 16:58

TylerH

21.3k84 gold badges84 silver badges121 bronze badges

asked Aug 6 at 12:54

Ciprian Istrate

314 bronze badges

Configuring a log collector will probably be more reliable than trying to manage all of the possible cases of kubectl logs; see Logging Architecture in the Kubernetes documentation. Configuring this is more of a system-administration question than a programming one. Yes, kubectl logs is stateless and prone to repeating content.

David Maze
– David Maze

2025-08-06 15:58:25 +00:00
Commented Aug 6 at 15:58
Thanks for the answer, what I see is that: After kubectl logs --tail=-1... the job status isn't there, so in the while loop is begin to duplicated logs (kubectl logs --tail=100 ....). A solution is to check the status: while [[ -z "\$(kubectl get job \${kube_job_name} --namespace=\${kube_namespace} -o jsonpath='{.status.conditions}')" ]]; do counter=\$((counter + 1)) echo "Waiting... (\$counter)" sleep 1 done What do you thinks?

Ciprian Istrate
– Ciprian Istrate

2025-08-07 05:18:58 +00:00
Commented Aug 7 at 5:18
or a sleep 2 :D

Ciprian Istrate
– Ciprian Istrate

2025-08-07 08:03:26 +00:00
Commented Aug 7 at 8:03

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Why does kubectl logs --follow show duplicate output when used in a loop to watch Kubernetes Job logs?

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest