GitHub Actions Pitfalls (opens in new tab)

(fusectore.dev)

241 pointsrethab3y ago103 comments

103 comments

83 comments · 24 top-level

sandreas3y ago· 10 in thread

If you want to run github actions locally before any pitfalls, you can try out

Have folks had good success with using this? For me it was extremely slow and it was far faster to just push a branch to github and test.

mkj3y ago

For me act works fairly well, though it isn't exactly the same as Github. Matrix builds didn't work properly (just noticed it has been fixed now), and the base images aren't quite the same.

Github should send a bunch of money to the act developer - I know I wouldn't have used Github actions at all without act existing, I'm sure other people must be in the same situation. (Though I'm not paying Github either, so perhaps I'm not a target customer...)

badloginagain3y ago

Like many things, the more complex the workflow the more useful it becomes running it locally.

I've found it especially useful for fixing complex workflows or working with custom actions. It's not strictly needed, but it does speed up your workflow once you figure out the kinks.

philshem3y ago

And for gitlab pipelines:

https://stackoverflow.com/q/32933174

anakaiti3y ago

Unfortunately, it only supports running single jobs. More complex tasks that requires dependencies, variables, job creation context (MR, Trigger, Web, etc.) can't be tested.

1 more reply

bastardoperator3y ago

GitHub also has a tool that will covert Jenkins pipelines (and other tooling) to Actions called Valet.

https://github.com/github/gh-valet

1980phipsi3y ago

That's pretty cool. I could have used something like that for Microsoft's CI.

vinceguidry3y ago

It should very much concern you that this is a third-party tool with zero support from GitHub. There is no first-party solution.

pdimitar3y ago

Ha, that's great, I am definitely trying it. Thank you.

More-nitors3y ago

hm you can just run self-hosted github action runners for free...

tomphoolery3y ago· 6 in thread

Set a global timeout for your jobs. Seriously. Think you don't need one? You're wrong. Set a global timeout for your jobs. Whoever pays that bill will thank you later. Private actions don't give a damn if `setup-node` is taking a whole hour to install Node. They don't care if your hosting service is having trouble and runs a deployment for 5 hours. You will be billed for that, and it adds up. Set a global timeout for your jobs.

TakeBlaster163y ago

Is there a way to set a global timeout now? Last I checked, you could only set them per job, i.e. you have to copy-paste "timeout-minutes: 15" to every single job in every single workflow in every single repo, and hope you didn't miss one. That's been the case for years[1] and a quick search shows it's still true[2]

The cynic in me thinks they like having the extra revenue.

[1] https://github.com/orgs/community/discussions/25472

[2] https://github.com/orgs/community/discussions/10690

lewisl90293y ago

This is a problem of misaligned incentives, not just in GitHub, but pretty much every CI provider out there.

In fact, the incentives are diametrically opposed in that almost every one of them makes more money when our builds take longer to run, regardless of the reason. So they are financially disincentivized to build anything that could make them faster, or even make it easier to limit the duration, as is the case here. When it does happen, it's a rare triumph of some combination of people with genuinely good intentions, customer demand, and competitive pressure, over the demand for financial returns that every company has to eventually come to terms with, and not sustainable over the super long term.

The ones that let us host our own runners at least offer us an escape hatch where we can opt out of the diametrically opposed incentive structure and back into a still-not-aligned but neutral one, but then we give up much of the benefits of CI as a SaaS and have to spend engineering hours building and maintaining our own build infrastructure at a significant cost.

Let's not forget that traditional CI in itself is already a commodity where providers sell us dumb CI minutes that we have to spend our own eng hours engineering deployment and testing solutions on top of, and eventually have to sink entire full-time engineering teams' worth of hours into fighting against the natural tendency for these systems to get slower as we add more code and people.

I believe the solution is deployment & testing platforms tailored to specific technologies, meticulously engineered to be so ridiculously fast that they can reasonably be offered as an all-you-can-eat plan for a fixed monthly price per seat, instead of the industry standard usage-based pricing of traditional CI providers. This aligns incentives much better in that slow builds hurt the provider's bottom line as much as they hurt customers' engineering productivity, and on the flip side it financially incentivizes constant investments into making the system even faster from the provider side since faster builds means they can serve more customers on the same hardware and pocket the difference as profit.

Shameless plug: I've been building one of these platforms at https://reflame.app. Reflame can deploy client-rendered React web apps in milliseconds, fast enough to make deploying to the internet feel like local dev.

rroot3y ago

I can feel the bitter experience in your writing! Ditto.

xani_3y ago

Applies to pretty much every resource. Monitor, monitor, monitor, then alert at the very least on thresholds, preferably on anomalies.

marcosdumay3y ago

The GP is more on the lines of limiting, not monitoring.

tjoff3y ago

Or, just don't run pipelines in the cloud with all the payment anxiety it brings.

bilalq3y ago· 5 in thread

A few other pitfalls:

* Scheduled actions basically never run anywhere close to on schedule. If you schedule something to run every 13 minutes, it may just run 1-3 times an hour with random 30 minute to 1 hour waits in between executions.

* Triggering a workflow as a result of an action from another workflow doesn't work if you're using the GITHUB_TOKEN as part of the action. Github does this to prevent accidental recursion, but it forces you to either use insecure PATs or rearchitect how to handle chained events: https://docs.github.com/en/actions/using-workflows/triggerin...

jacobyoder3y ago

Yet another pitfall: Changing the system clock on runners can throw off billing and calculation of used minutes. Colleague of mine told me about that one last year.

dylan6043y ago

I'm guessing not in your favor either?

I miss the days of setting the clock/date to avoid time bombs in software builds. "back in my day, things were so much easier!" now, I would not be surprised if the teams working on these kinds of lock outs are larger than the teams building the product.

1 more reply

naikrovek3y ago

why would you change the clock and not the timezone? why would you even change the timezone?

1 more reply

naikrovek3y ago

don't rely on public infrastructure of unknown usage to do scheduled tasks of precise execution time is required. public GitHub runners offer few guarantees.

I welcome their anti-recursion measures because I fought in the recursive clone wars and no one should have to support any systems that allow that.

bilalq3y ago

???

I'm not talking about using this on a free tier or something. Github actions are billed monthly. This goes way beyond just not having a tight SLA. Precision isn't even the ask here. It's one thing if a job scheduled to run every 10 minutes occasionally takes 12 or 13 in between runs. It's a completely different matter if it takes an hour.

Having some safe-guards against unbounded recursion is one thing, but the escape hatch for it right now is to use less secure credentials. That's just madness.

cmcconomy3y ago· 5 in thread

I now use a Makefile and put as much logic as I can in `make cicd` so I can call it in a single line, keeping the GitHub action as simple as possible.

0xbadcafebee3y ago

Absolutely. The problem is when you have a lot of time-consuming steps and you only want to re-run the failed one with a slight change and then continue where it left off. Make can do that of course, but you need to make Make do it, and save/restore a workspace/artifacts. I haven't done that in GHA and GHA has lacked a lot of core ci/cd functionality for a long time, so I don't know if it's possible in GHA.

thaeli3y ago

Steps that only cache their final artifact on success, and an if condition on the skippable steps that only runs if the artifact doesn't exist. I'm using this to prevent re-running builds when a successful build for the same SHA already exists, for instance.

phist_mcgee3y ago

+1 for Justfiles[1]! They're a much saner alternative to makefiles with great shell support

[1] https://github.com/casey/just

naikrovek3y ago

containers are where you want to be, if you can do it.

BYO environment.

1 more reply

hk13373y ago

Makefile or Ansible playlist, depending on what you're doing? Use Github Actions to prep the environment to run make or Ansible?

ruuda3y ago· 5 in thread

Also a fun one is the "on" key that specifies when the workflow should run. "on" is magic in yaml, and some implementations will convert it to the string (!) "True" when it occurs as a key (I'm not talking about values here). This was a bit confusing when I tried to replace a hand-written yaml with a generated json ... They were identical, except for the on/True key. It's still not clear to me whether this is according to yaml spec or not, but in any case a json "on" key does work ... So I wonder, does GitHub Actions internally look for both "on" and "True"?

masklinn3y ago

You might be getting the string True because in Yaml 1.1 the scalars “y”, “n”, “yes”, “no”, “on”, and “off” (in all their casings) are Boolean literals.

I believe YAML supports non-string keys, so your key would be parsed to the corresponding Boolean value (true), if the pipeline then goes through JSON where only string keys are supported the serialiser could simply stringing the key rather than raise an error, leading to “True”.

And that’s one of the billion reasons why barewords are bad.

I think this has been fixed in Yaml 1.2, but there’s a lot of Yaml 1.1 libraries out there, and they can’t just switch since they could break user code.

0xbadcafebee3y ago

I once worked on a project where the input was YAML config files and a lot of different programs would read/write the files. Every different parser had at least one implementation-specific quirk. Often we would run into the quirks because someone edited the YAML by hand, and one parser would handle it fine, while another would barf.

That's when I found out the YAML spec explicitly says it's human-readable, not human-writeable. Our mistake was assuming YAML was a configuration format, when actually it's a data serialization format (again, spec explicitly says this) that is easy to read.

Now I only write YAML files with a YAML generator, because just running a hand-edited file through a parser may fall victim to a parser quirk.

2 more replies

jwilk3y ago

> I wonder, does GitHub Actions internally look for both "on" and "True"?

More likely they hacked their YAML parser to treat on as a string.

At least that's what Travis CI folks did:

https://github.com/travis-ci/travis-yml#user-content-yaml

geerlingguy3y ago

I wrap on in quotes since my syntax highlighter translates it to a bool. You can quote keys in YAML without issue, but it does look a little strange.

TomSwirly3y ago

> "True" when it occurs as a key (I'm not talking about values here).

In fact, YAML does that terrible substitution for both keys and values.

darthcloud3y ago· 5 in thread

The most annoying thing for me is that a workflow_dispatch only workflow can't be launch manually until it's push into default branch as they are not listed. I can understand the Web-UI won't list them but even GitHub cli can't launch them. Once they appear in the default branch, only then you are free to launch them on any branch.

yebyen3y ago

There's something in here I don't understand, and I thought I knew the reason why it does this (for at least some workflow types, maybe not workflow_dispatch)

If you're only free to run those workflows when they land in the default branch, does that also mean that the workflow that runs is the one from the default branch and if you change the workflow in a PR, it will only run the new workflow on merge?

I know there's something in here to permit non-owned commits (from an external contributor) to be tested against a trusted workflow from the main branch, but I don't think it has anything to do with workflow_dispatch. I would expect that if you're able to run workflows and target any branch, then if the workflow you run is the one contained in that branch, you'd be able to select any workflow that is named and defined in the branch's configuration.

I'm not saying that's how it works, I'm saying that's how I'd imagine it to work. If someone knows "the rule" that we can use to disambiguate this and understand it from the precepts that went into the design, maybe speak up here? I don't get it.

naikrovek3y ago

> If you're only free to run those workflows when they land in the default branch, does that also mean that the workflow that runs is the one from the default branch and if you change the workflow in a PR, it will only run the new workflow on merge?

the premise of your question is wrong. you can trigger workflow_dispatch workflows in any branch via the UI if a workflow by that name also exists in the default branch, and only via API if no workflow by that name exists in the default branch.

1 more reply

jp423y ago

I use gh workflow cli. it works without pushing workflow to default branch. https://cli.github.com/manual/gh_workflow_run

xvello3y ago

Agreed. The workaround my team uses is to first merge an empty action with the right parameters set, then open a second branch to implement the action steps. Once the action hits main, you can start it, using the definition from any branch.

naikrovek3y ago

the API supports workflow_dispatch in non-default branches, even if that workflow doesn't exist in the default branch. you just need to call the API to do it. curl makes it pretty easy but not as easy as clicking a button, I agree.

ducktective3y ago· 4 in thread

The UI for seeing logs is driving me insaaaane. It's extremely slow and sluggish. Sometimes you have to refresh the page to see the actual latest line of the log.

ruuda3y ago

It also regularly repeats the same line number. E.g. it numbers them 120, 121, ..., 131, 132, 120, 121, ...

Chris20483y ago

also normal search doesn't work, you have to use the in-UI JS search widget.

williamdclt3y ago

Pressing ctrl-f twice works (not that it's good UX)

1 more reply

thedougd3y ago

And won't show history when loading the logs page for jobs in progress.

ryan-duve3y ago· 3 in thread

I've used Github Actions at work for the past year and I'm a fan overall. The clearest sign of this is my feedback for improvement is almost entirely about missing features instead of broken ones. For examples, it'd be nice:

1. for Github to natively allow CI management for several repos in a centralized way, so repo setup can just be "select this CI config" instead of "copy this YAML file and change the project name in some places"

2. to mandate certain CI steps at the organization level (such as running `black`) so it isn't opt-in

jrochkind13y ago

Me, I looove that the actions config has to be in a file in the repo, so I know where to find it, and if I have read access to the repo, I have access to the config. (Don't even need to be logged into a GH acct, although I usually am).

If they allowed config to come from an internal setting not visible in repo, i'm sure repos I collaborate with would start using that feature, and I would not be able to find their Actions configs.

(I work mostly on open source, which may lead to different patterns of access and such).

1 more reply

franky473y ago

Looks like there is a solution for #1: https://docs.github.com/en/actions/using-workflows/reusing-w...

I haven't tried it yet though.

coredog643y ago

It’s still early days. You can’t use “act” locally if you have a reusable workflow. And what bit me was that you can’t pass environment variables from the caller. My workaround was to write them to a file and then cat the file into $GITHUB_ENV in the reusable workflow.

However, that then exposed me to the up thread bug about files. So now I also have to delete the file before creating it. Sigh.

ectopod3y ago· 3 in thread

Another surprise: "ubuntu-latest" is not the latest ubuntu! It is stuck at 20.04. If you want 22.04 you need to specify "ubuntu-22.04". Similar issue with macos-latest.

naikrovek3y ago

well that is clearly specified in the docs.

"ubuntu-latest" isn't necessarily the latest Ubuntu, it's the latest version that has been fixed to the point of having no workflow-breaking known issues, I believe.

silverwind3y ago

With 20.04, they were over a year behind when they finally updates the latest tag.

naikrovek3y ago

I suggest you follow the issues in that repo if you want to see why. it's difficult keeping the massive packer template they use in good shape.

https://github.com/actions/runner-images/

I rely on that repo to build my own images and it is a frequent cause of failed builds. I'm going to convert almost all of it to installation via homebrew instead, I think. works well for MacOS anyway.

newman3143y ago· 3 in thread

I generally like Actions but occasionally run into limitations (possibly my own).

For example, I'd like to build an action that triggers a documentation update based on the path and filename that is changed.

  on:
    push:
      branches: 
        - main
      paths:
        - */README.md

But there does not appear to be a way to pass a list of changed paths to the job.

hk13373y ago

Maybe keep your documentation in a single directory?

  on:
    push:
      branches:
        - main
      paths:
        - docs/**
        - README.md

I use something similar for triggering different app workflows in a monorepo.

*EDIT* Or in multiple directories but grouped into multiple documentation directories.

  on:
    push:
      branches:
        - main
      paths:
        - package1/docs/**
        - package2/docs/**
        - package3/docs/**
        - README.md

callahad3y ago

GitHub Actions doesn't do anything to make this easy for you, but it's entirely possible to fall back on normal git operations to determine the files changed between two commits, e.g., https://stackoverflow.com/questions/1552340/how-to-list-only...

thaeli3y ago

See this action: https://github.com/tj-actions/changed-files

WirelessGigabit3y ago· 2 in thread

Dear GitHub actions. I want &pointers so I don’t have to repeat myself.

Also, I like that you build the hypothetical merge of branch + main. But that commit SHA is gone after that successful build. Give me a way to track this. I need to store artifacts related to this build, as I don’t want to build those again!

gabeio3y ago

> I need to store artifacts related to this build, as I don’t want to build those again!

https://docs.github.com/en/actions/using-workflows/storing-w...

they do seem to be capable of saving most things people call artifacts or if you are looking for something more along the lines of caching parts of the build for future builds, you can adjust it pretty easily by adjusting what the cache key is based on.

example:

  key: ${{ runner.os }}-cargo-${{ hashFiles('**/Cargo.lock') }}

which will allow you to cached based on the hash of a specific deps lock file instead of the commit sha.

https://docs.github.com/en/actions/using-workflows/caching-d...

https://github.com/actions/cache

The one note here is clearing that cache/cache management isn't straight forward currently (although they are improving it), there are a few acceptable workarounds though.

Not sure if you were aware of these already.

naikrovek3y ago

reusable workflows are a thing now. may not meet your needs, but it's something.

zufallsheld3y ago· 2 in thread

The twice run workflow from "Push for all?" definitely happened to me, too. The others not so much.

In New projects I tend to use scripts to perform any required task for the ci and have github actions only run the script. Way easier to reason about.

Gitlab CI definitely handles this better with it "script" concept.

spaceywilly3y ago

Yeah, I’ve lived through many a transition from one CI server to the next, so nowadays I just have CI call a script. You want to really minimize the amount you depend on a particular CI server’s features, to make switching very easy. Even if you never have to switch, it will be easier to maintain.

o_13y ago

Gitlab CI is really powerful.

gscho3y ago· 2 in thread

Another pitfall I have encountered is the lack of a true ephemeral agent runner solution for running the actions runner agent in your own infrastructure. The way it works (the last time I checked) is when you register a worker as "ephemeral: true" it automatically deregisters itself from your runner pool and kills the agent process when a job is completed, but it is up to you to clean things up. This leads to somewhat hacky scripts to delete the compute instance after the agent process exits. There is also no officially supported kubernetes controller for creating ephemeral agents but the community created one [1] is often mistaken as an official github project.

- [1] https://github.com/actions-runner-controller/actions-runner-...

judge20203y ago

https://github.com/github/roadmap/issues/555 Would this fix anything?

naikrovek3y ago

actions-runner-controller has been worked on by GitHub for a while and is effectively an official project. I don't think the distinction is important at this point.

my employer used some code from philips-labs to support ephemeral runners. works great after a few customizations.

I wrote a shell script and a very small Go program to support ephemeral MacOS runners on-premise.

these things are so fun to work on.

speedgoose3y ago· 1 in thread

I believe pull request from forks are not triggered by default because some people where using this to mine cryptocrap on cpu using the quotas of other projects.

acedTrex3y ago

Ya, while this restriction makes sense it does mean that you have to jump through a few weird hoops to get Fork based workflows to allow commenting on pull requests from the action

pydry3y ago· 1 in thread

The biggest pitfall I see is people inadvertently making them "smart" which causes massive headaches when debugging them.

As much intelligence as possible ought to be pushed down to and tested and debugged on the script level so that you're left with little more than a linear sequence of 4-5 commands in your YAML.

The debugging tooling on github actions is, frankly, abysmal. You need a third party ngrok action to even run ssh.

spaceywilly3y ago

+1. The logic in any CI server should be “call the build script”. This make it so much easier to debug failures, and easy to switch to another CI setup when the current director of IT quits and a new one comes in and forces everyone to use his/her favorite CI server.

dtmtcm3y ago· 1 in thread

Their self-hosted runners are pretty jank. If your workflow writes something to the docker container's user's home directory, you will see it in the next workflow you run. Due to this and other things, I need a "preamble" action that needs to run right after checkout. Oh, if don't checkout at the beginning of your workflow, you will be using the previous workflow's copy of the repository.

I'm 100% sure they don't use this internally as these are glaring issues that impacts anyone using the self hosted runner. They also recommend running the container as root[1] instead of designing something more secure and sane.

1: https://github.com/actions/runner/issues/434#issuecomment-61...

naikrovek3y ago

it's not about security or sanity, it's because people run containers whose UIDs do not match the host system, and they write to the host system by mounting volumes for the container to use.

the result is root or another user inside the container can write root-owned files because they have the same UID as root on the container host.

my employer runs an orchestrator and destroys each runner VM after a single job so this only bites the user who causes it, and not anyone else.

richardfey3y ago· 1 in thread

Oh, there's many more. For example: the cache action is lacking restore-only options, and there's no explicit step to fail a workflow.

WirelessGigabit3y ago

Wouldn’t artifacts be better for that then? A cache can be empty.

horse6663y ago

There’s an awkward gotcha/incompatibility between “Required status checks” and workflows that get skipped [1], eg due to setting a “paths” property of a push/pull_request workflow trigger [2].

The checks associated with the workflow don’t run and stay in a pending state, preventing the PR from being merged.

The only workaround I’m aware of is to use an action such as paths-filter [3] instead at the job level.

A further, related frustration/limitation - you can _only_ set the “paths” property [2] at the workflow level (i.e. not per-job), so those rules apply to all jobs in the workflow. Given that you can only build a DAG of jobs (ie “needs”) within a single workflow, it makes it quite difficult to do anything non trivial in a monorepo.

[1]: https://docs.github.com/en/repositories/configuring-branches...

[2]: https://docs.github.com/en/actions/using-workflows/workflow-...

[3]: https://github.com/dorny/paths-filter

ripperdoc3y ago

We recently converted all our projects to Github Actions, and while it really brings a lot of convenience it also feels to me like a very brittle solution with lots of gotchas and messy API surfaces.

Of course, the nature of running various commands on virtual machines and shells is inherently messy, but GHA could have done a lot to hide that. Instead I feel like I'm forced mix YAML, bash, Powershell and various higher-level scripting languages (that came with the projects) in an unholy concoction that is hard to get right (return codes, passing values and escaping directly comes to mind) and that is even harder to debug, due to the nature of running somewhere else (act helps, a little, but it doesn't properly replicate the GHA environment).

I kind of wished I could write all my workflows cross-platform from start with some well known but fullfledged scripting language. (Which of course I could, and just use GHA to call that script). What options are out there to make this whole thing less brittle?

notrob3y ago

> The problem is that exec does not return a non-zero return code if the command fails. Instead, it returns a rejected promise.

> While this behavior can be changed by passing ignoreReturnCode as the third argument ExecOptions, the default behavior is very surprising.

This is the same behavior as node's child process exec when wrapped by util.promisify[1] If something returns a promise (async func) it should be expected that it has the possibility of being rejected.

[1] https://nodejs.org/api/child_process.html#child_processexecc...

rhysd3y ago

The first pitfall can be statically detected with actionlint

https://github.com/rhysd/actionlint

> oops.yaml:20:24: property "jobone" is not defined in object type {jobtwo: {outputs: {}; result: string}} [expression]

m12k3y ago

Another pitfall I ran into recently with a workflow I've been working on [1]: Checks and CI that are made with GitHub Actions are reported to the new Checks API, while some (all?) external services report to their old Statuses API. This makes it needlessly difficult to ascertain whether a PR/branch is "green" or not. They finally decided to create a "statusRollUp" that combines the state of the two APIs, but it's not available in their REST api, only their GraphQL API.

[1] https://github.com/hrvey/combine-prs-workflow/

louislang3y ago

Docker request limits are kind of a pain to deal with in Github Actions. This recently bit us, and no amount of logging into a _paid_ docker account would rectify the problem.

As it turns out, images are pulled at the start of the run, which means your docker login will have no effect if you're currently bumping into these pull limits. This is made worse by the fact that the images themselves are controlled in the remote actions you're using, not something in your own codebase.

So you're left with either: forking the action and controlling it yourself, or hoping the maintainer will push to the Github registry.

encoderer3y ago

We have a new integration at Cronitor to fully monitor your GitHub actions - works with both hosted and self-hosted runners. This is in beta right now. If anybody would like to check it out, let me know, Shane at cronitor

j / k navigate · click thread line to collapse

103 comments

83 comments · 24 top-level

sandreas3y ago· 10 in thread

If you want to run github actions locally before any pitfalls, you can try out

https://github.com/nektos/act

kilroy1233y ago

Have folks had good success with using this? For me it was extremely slow and it was far faster to just push a branch to github and test.

mkj3y ago

For me act works fairly well, though it isn't exactly the same as Github. Matrix builds didn't work properly (just noticed it has been fixed now), and the base images aren't quite the same.

badloginagain3y ago

Like many things, the more complex the workflow the more useful it becomes running it locally.

I've found it especially useful for fixing complex workflows or working with custom actions. It's not strictly needed, but it does speed up your workflow once you figure out the kinks.

philshem3y ago

And for gitlab pipelines:

https://stackoverflow.com/q/32933174

anakaiti3y ago

Unfortunately, it only supports running single jobs. More complex tasks that requires dependencies, variables, job creation context (MR, Trigger, Web, etc.) can't be tested.

1 more reply

bastardoperator3y ago

GitHub also has a tool that will covert Jenkins pipelines (and other tooling) to Actions called Valet.

https://github.com/github/gh-valet

1980phipsi3y ago

That's pretty cool. I could have used something like that for Microsoft's CI.

vinceguidry3y ago

It should very much concern you that this is a third-party tool with zero support from GitHub. There is no first-party solution.

pdimitar3y ago

Ha, that's great, I am definitely trying it. Thank you.

More-nitors3y ago

hm you can just run self-hosted github action runners for free...

tomphoolery3y ago· 6 in thread

TakeBlaster163y ago

The cynic in me thinks they like having the extra revenue.

[1] https://github.com/orgs/community/discussions/25472

[2] https://github.com/orgs/community/discussions/10690

lewisl90293y ago

This is a problem of misaligned incentives, not just in GitHub, but pretty much every CI provider out there.

rroot3y ago

I can feel the bitter experience in your writing! Ditto.

xani_3y ago

Applies to pretty much every resource. Monitor, monitor, monitor, then alert at the very least on thresholds, preferably on anomalies.

marcosdumay3y ago

The GP is more on the lines of limiting, not monitoring.

tjoff3y ago

Or, just don't run pipelines in the cloud with all the payment anxiety it brings.

bilalq3y ago· 5 in thread

A few other pitfalls:

jacobyoder3y ago

Yet another pitfall: Changing the system clock on runners can throw off billing and calculation of used minutes. Colleague of mine told me about that one last year.

dylan6043y ago

I'm guessing not in your favor either?

1 more reply

naikrovek3y ago

why would you change the clock and not the timezone? why would you even change the timezone?

1 more reply

naikrovek3y ago

don't rely on public infrastructure of unknown usage to do scheduled tasks of precise execution time is required. public GitHub runners offer few guarantees.

I welcome their anti-recursion measures because I fought in the recursive clone wars and no one should have to support any systems that allow that.

bilalq3y ago

???

Having some safe-guards against unbounded recursion is one thing, but the escape hatch for it right now is to use less secure credentials. That's just madness.

cmcconomy3y ago· 5 in thread

I now use a Makefile and put as much logic as I can in `make cicd` so I can call it in a single line, keeping the GitHub action as simple as possible.

0xbadcafebee3y ago

thaeli3y ago

phist_mcgee3y ago

+1 for Justfiles[1]! They're a much saner alternative to makefiles with great shell support

[1] https://github.com/casey/just

naikrovek3y ago

containers are where you want to be, if you can do it.

BYO environment.

1 more reply

hk13373y ago

Makefile or Ansible playlist, depending on what you're doing? Use Github Actions to prep the environment to run make or Ansible?

ruuda3y ago· 5 in thread

masklinn3y ago

You might be getting the string True because in Yaml 1.1 the scalars “y”, “n”, “yes”, “no”, “on”, and “off” (in all their casings) are Boolean literals.

And that’s one of the billion reasons why barewords are bad.

I think this has been fixed in Yaml 1.2, but there’s a lot of Yaml 1.1 libraries out there, and they can’t just switch since they could break user code.

0xbadcafebee3y ago

Now I only write YAML files with a YAML generator, because just running a hand-edited file through a parser may fall victim to a parser quirk.

2 more replies

jwilk3y ago

> I wonder, does GitHub Actions internally look for both "on" and "True"?

More likely they hacked their YAML parser to treat on as a string.

At least that's what Travis CI folks did:

https://github.com/travis-ci/travis-yml#user-content-yaml

geerlingguy3y ago

I wrap on in quotes since my syntax highlighter translates it to a bool. You can quote keys in YAML without issue, but it does look a little strange.

TomSwirly3y ago

> "True" when it occurs as a key (I'm not talking about values here).

In fact, YAML does that terrible substitution for both keys and values.

darthcloud3y ago· 5 in thread

yebyen3y ago

There's something in here I don't understand, and I thought I knew the reason why it does this (for at least some workflow types, maybe not workflow_dispatch)

naikrovek3y ago

1 more reply

jp423y ago

I use gh workflow cli. it works without pushing workflow to default branch. https://cli.github.com/manual/gh_workflow_run

xvello3y ago

naikrovek3y ago

ducktective3y ago· 4 in thread

The UI for seeing logs is driving me insaaaane. It's extremely slow and sluggish. Sometimes you have to refresh the page to see the actual latest line of the log.

ruuda3y ago

It also regularly repeats the same line number. E.g. it numbers them 120, 121, ..., 131, 132, 120, 121, ...

Chris20483y ago

also normal search doesn't work, you have to use the in-UI JS search widget.

williamdclt3y ago

Pressing ctrl-f twice works (not that it's good UX)

1 more reply

thedougd3y ago

And won't show history when loading the logs page for jobs in progress.

ryan-duve3y ago· 3 in thread

2. to mandate certain CI steps at the organization level (such as running `black`) so it isn't opt-in

jrochkind13y ago

If they allowed config to come from an internal setting not visible in repo, i'm sure repos I collaborate with would start using that feature, and I would not be able to find their Actions configs.

(I work mostly on open source, which may lead to different patterns of access and such).

1 more reply

franky473y ago

Looks like there is a solution for #1: https://docs.github.com/en/actions/using-workflows/reusing-w...

I haven't tried it yet though.

coredog643y ago

However, that then exposed me to the up thread bug about files. So now I also have to delete the file before creating it. Sigh.

ectopod3y ago· 3 in thread

Another surprise: "ubuntu-latest" is not the latest ubuntu! It is stuck at 20.04. If you want 22.04 you need to specify "ubuntu-22.04". Similar issue with macos-latest.

naikrovek3y ago

well that is clearly specified in the docs.

"ubuntu-latest" isn't necessarily the latest Ubuntu, it's the latest version that has been fixed to the point of having no workflow-breaking known issues, I believe.

silverwind3y ago

With 20.04, they were over a year behind when they finally updates the latest tag.

naikrovek3y ago

I suggest you follow the issues in that repo if you want to see why. it's difficult keeping the massive packer template they use in good shape.

https://github.com/actions/runner-images/

newman3143y ago· 3 in thread

I generally like Actions but occasionally run into limitations (possibly my own).

For example, I'd like to build an action that triggers a documentation update based on the path and filename that is changed.

  on:
    push:
      branches: 
        - main
      paths:
        - */README.md

But there does not appear to be a way to pass a list of changed paths to the job.

hk13373y ago

Maybe keep your documentation in a single directory?

  on:
    push:
      branches:
        - main
      paths:
        - docs/**
        - README.md

I use something similar for triggering different app workflows in a monorepo.

*EDIT* Or in multiple directories but grouped into multiple documentation directories.

  on:
    push:
      branches:
        - main
      paths:
        - package1/docs/**
        - package2/docs/**
        - package3/docs/**
        - README.md

callahad3y ago

thaeli3y ago

See this action: https://github.com/tj-actions/changed-files

WirelessGigabit3y ago· 2 in thread

Dear GitHub actions. I want &pointers so I don’t have to repeat myself.

gabeio3y ago

> I need to store artifacts related to this build, as I don’t want to build those again!

https://docs.github.com/en/actions/using-workflows/storing-w...

example:

  key: ${{ runner.os }}-cargo-${{ hashFiles('**/Cargo.lock') }}

which will allow you to cached based on the hash of a specific deps lock file instead of the commit sha.

https://docs.github.com/en/actions/using-workflows/caching-d...

https://github.com/actions/cache

The one note here is clearing that cache/cache management isn't straight forward currently (although they are improving it), there are a few acceptable workarounds though.

Not sure if you were aware of these already.

naikrovek3y ago

reusable workflows are a thing now. may not meet your needs, but it's something.

zufallsheld3y ago· 2 in thread

The twice run workflow from "Push for all?" definitely happened to me, too. The others not so much.

In New projects I tend to use scripts to perform any required task for the ci and have github actions only run the script. Way easier to reason about.

Gitlab CI definitely handles this better with it "script" concept.

spaceywilly3y ago

o_13y ago

Gitlab CI is really powerful.

gscho3y ago· 2 in thread

- [1] https://github.com/actions-runner-controller/actions-runner-...

judge20203y ago

https://github.com/github/roadmap/issues/555 Would this fix anything?

naikrovek3y ago

actions-runner-controller has been worked on by GitHub for a while and is effectively an official project. I don't think the distinction is important at this point.

my employer used some code from philips-labs to support ephemeral runners. works great after a few customizations.

I wrote a shell script and a very small Go program to support ephemeral MacOS runners on-premise.

these things are so fun to work on.

speedgoose3y ago· 1 in thread

I believe pull request from forks are not triggered by default because some people where using this to mine cryptocrap on cpu using the quotas of other projects.

acedTrex3y ago

Ya, while this restriction makes sense it does mean that you have to jump through a few weird hoops to get Fork based workflows to allow commenting on pull requests from the action

pydry3y ago· 1 in thread

The biggest pitfall I see is people inadvertently making them "smart" which causes massive headaches when debugging them.

As much intelligence as possible ought to be pushed down to and tested and debugged on the script level so that you're left with little more than a linear sequence of 4-5 commands in your YAML.

The debugging tooling on github actions is, frankly, abysmal. You need a third party ngrok action to even run ssh.

spaceywilly3y ago

dtmtcm3y ago· 1 in thread

1: https://github.com/actions/runner/issues/434#issuecomment-61...

naikrovek3y ago

it's not about security or sanity, it's because people run containers whose UIDs do not match the host system, and they write to the host system by mounting volumes for the container to use.

the result is root or another user inside the container can write root-owned files because they have the same UID as root on the container host.

my employer runs an orchestrator and destroys each runner VM after a single job so this only bites the user who causes it, and not anyone else.

richardfey3y ago· 1 in thread

Oh, there's many more. For example: the cache action is lacking restore-only options, and there's no explicit step to fail a workflow.

WirelessGigabit3y ago

Wouldn’t artifacts be better for that then? A cache can be empty.

horse6663y ago

The checks associated with the workflow don’t run and stay in a pending state, preventing the PR from being merged.

The only workaround I’m aware of is to use an action such as paths-filter [3] instead at the job level.

[1]: https://docs.github.com/en/repositories/configuring-branches...

[2]: https://docs.github.com/en/actions/using-workflows/workflow-...

[3]: https://github.com/dorny/paths-filter

ripperdoc3y ago

We recently converted all our projects to Github Actions, and while it really brings a lot of convenience it also feels to me like a very brittle solution with lots of gotchas and messy API surfaces.

notrob3y ago

> The problem is that exec does not return a non-zero return code if the command fails. Instead, it returns a rejected promise.

> While this behavior can be changed by passing ignoreReturnCode as the third argument ExecOptions, the default behavior is very surprising.

[1] https://nodejs.org/api/child_process.html#child_processexecc...

rhysd3y ago

The first pitfall can be statically detected with actionlint

https://github.com/rhysd/actionlint

> oops.yaml:20:24: property "jobone" is not defined in object type {jobtwo: {outputs: {}; result: string}} [expression]

m12k3y ago

[1] https://github.com/hrvey/combine-prs-workflow/

louislang3y ago

Docker request limits are kind of a pain to deal with in Github Actions. This recently bit us, and no amount of logging into a _paid_ docker account would rectify the problem.

So you're left with either: forking the action and controlling it yourself, or hoping the maintainer will push to the Github registry.

encoderer3y ago

j / k navigate · click thread line to collapse