Can confirm cloud GPU is way overpriced if you're doing 24/7 rendering. We run a...

jack2222 · on Jan 2, 2021

I've had renders cost $50,000! Or CTO was less than amused

cosmodisk · on Jan 3, 2021

I hope you didn't have to bin them, as Vogue did with one of their photoshoots costing a bank:)

dtgriscom · on Jan 3, 2021

Sounds interesting: reference?

cosmodisk · on Jan 5, 2021

In the documentary: The September Issue

rbanffy · on Jan 3, 2021

> Running 24/7 for three months, it's cheaper to buy consumer grade hardware

If you have a steady load cloud makes little sense. It only makes sense if you have a tight deadline (as is not that uncommon with video and VFX) and can't fit it within your deployed capacity.

malthejorgensen · on Jan 3, 2021

How do you manage the bare metal cluster? (E.g. apt/yum updates but also networking and such)

erosenbe0 · on Jan 3, 2021

I'm a bit out of date but if we are talking about rendering (not data retrieval workloads) I believe the best way is fundamentally the same as it was 25 years ago: network boot, mostly network storage, and applying local config overlays based on MAC address or equivalents. Exactly what push or pull techniques are in vogue I am not sure but definitely no running package managers on each node. You want as little as possible locally -- just a scratchpad disk that can be rebuilt automatically in minutes.

tinco · on Jan 3, 2021

When it was 3 nodes, and then 6 nodes, the answer was very unprofessionally. I didn't get the budget for a system administrator, and I spent all my budget on developers that could build our application and automate our preprocessing, overlooked system administration skills. So besides the DoE, managing 3 small teams and being the lead developer, I also am the system administrator.

So no fancy answer, our 3D experts got TeamViewer access to the nodes running Windows Pro. Sometimes our renders fail on patch Tuesday because I forgot to reapply the no-reboot hack.

We're professionalizing now at 12 nodes, we got to the point where the 3D experts don't need to TeamViewer in, so we're swapping them to headless Linux. No idea on the update management yet, but they're clean nodes running Ubuntu server.

Narann · on Jan 3, 2021

Network solutions highly depends on the physical infrastructure, but for setup maintenance, you can often see SaltStack.