The AMD Radeon Graphics Driver Makes Up Roughly 10.5% of the Linux Kernel

broodbucket · on Oct 11, 2020

My two cents as a kernel developer, the driver is pretty abominable compared to the code quality of most of the rest of the kernel.

However, having a GPU driver not just be open source but in the upstream Linux kernel is a gigantic deal. Kernel development takes a long time, we have millions of lines in the amdgpu driver, and if every one of those dealt with the lengthy review process it would have never made it in the tree.

So it's a necessary evil. I do wish they would clean it up though, I sent a fix to amdgpu once that was the same thing for 3 different files that were largely duplicated. That kind of thing wouldn't fly anywhere else in the kernel

m463 · on Oct 12, 2020

I would also mention that gpus are a GIANT abstraction, and since they are rev'd faster than arguably any hardware in a sytem, there are abstractions layered on top of that for the families and models of gpus too.

Another way of looking at it is -- I started playing with openwrt for a relatively small router, with 5 ports plus wifi.

I was amazed at not only the amount of openwrt code required to support the different router families and the different router models, but at the sheer amount of stuff turned on by default in the kernel just in case I might need to load a module for some obscure feature or package. I assume the same goes for a gpu driver both at the source level and in the kernel.

andromeduck · on Oct 12, 2020

Yeah AMD/NV typically do 2-3 chips per year. That complexity adds up pretty quickly when backwards/forwards compatibility is fairly strict and the inputs/abstractions not particularly well defined or behaved.

simfoo · on Oct 12, 2020

I'm wondering at which point it would make sense to split the driver into multiple device family drivers instead of lumping it all together into a mess of unmaintainable abstractions.

inopinatus · on Oct 12, 2020

At the point where people are making jokes about the Linux kernel making up X% of the GPU driver.

rurban · on Oct 12, 2020

The opposite. The lack of abstractions, mere industrial copy pasta code is the problem.

andy_ppp · on Oct 12, 2020

I’m not saying it’s true here but duplication over the wrong abstraction is always better. Seems to me if each graphics card is different enough there’s probably lots to duplicate.

VBprogrammer · on Oct 12, 2020

In software development absolutes are always dangerous.

andy_ppp · on Oct 12, 2020

Apart from absolutes about absolutes.

rurban · on Oct 15, 2020

Nonsense. This duplication is not maintainable and needs massive amounts of memory. Proper abstraction adds a few if else in the data, and is about 20x smaller. You can even read and understand that, e.g. what changed with this HW upgrade. No chance with duplicated blobs of structs and enums.

andy_ppp · on Oct 15, 2020

> I’m not saying it’s true

Yes, I did caveat my suggestion. Why not submit a fix if it's so simple ;-)

mrweasel · on Oct 12, 2020

If the Radeon driver is in the kernel now, then maybe at some point someone other than AMD will pick it up and start cleaning up excessive copy paste code.

Assuming they play nice with the community, it could be a huge benefit to AMD in the long run.

Still, 2 million lines is a massive amount of code to start working on.

samus · on Oct 15, 2020

Unlikely, unless that entity has all the supported hardware at hand and ready for automated tests. Refactoring without a thorough test harness is ref*toring.

jimmySixDOF · on Oct 12, 2020

isn't that what Google are hoping to do with Fuchsia to make a next generation of Android that's not dependant on device drivers in the kernel ?

junon · on Oct 12, 2020

I can't see how they'd be able to achieve that, unless you mean simply that the device drivers would run in userland.

CyberDildonics · on Oct 12, 2020

Reinventing QNX will always be cutting edge

de_watcher · on Oct 12, 2020

Are there still workarounds for specific games/programs inside the driver?

Cloudef · on Oct 12, 2020

You can see all the workaround used in mesa here: https://gitlab.freedesktop.org/mesa/mesa/-/blob/master/src/u...

Propriatery drivers (especially nvidia), most likely has lots of similar game specific workarounds and optimizations (even going as far as overriding shaders in games with better ones they wrote). [1]

1: https://www.gamedev.net/forums/topic/666419-what-are-your-op...

TheFlash · on Oct 12, 2020

Yes. In fact, that's essentially what the drivers are.

psykotic · on Oct 12, 2020

Leaving aside arguments about "what the drivers are", the kernel driver being discussed here generally doesn't have or need that kind of thing. The user-space drivers which talk to the kernel drivers are under the Mesa umbrella as part of Gallium for OpenGL and Direct3D support (e.g. https://github.com/mesa3d/mesa/tree/master/src/gallium/drive...) or as a standalone driver for Vulkan support (https://github.com/mesa3d/mesa/tree/master/src/amd/vulkan). That said, I haven't seen many app-specific hacks in the open source drivers, even in the user-space code.

If anyone wants to learn more about lower-level aspects of GPUs, the Vulkan driver code I linked is one of the best places to start. It directly implements the Vulkan API on one end and talks to the kernel drivers on the other end, so it's relatively easy to follow if you're a systems programmer with an API-level understanding of graphics. Just pick a Vulkan function of your choice and start tracing through the code, e.g. vkCmdDraw: https://github.com/mesa3d/mesa/blob/master/src/amd/vulkan/ra.... The Vulkan driver calls into some of the low-level radeonsi code I linked from the Gallium tree but it isn't a Gallium-based driver, so you don't have to deal with those extra layers of abstraction.

account42 · on Oct 12, 2020

> That said, I haven't seen many app-specific hacks in the open source drivers, even in the user-space code.

They are enabled via driconf [0]. Not nearly as many as what I imagine you'd find in the proprietary Windows drivers though.

[0] https://github.com/mesa3d/mesa/blob/master/src/util/00-mesa-...

Audiophilip · on Oct 12, 2020

Also: https://github.com/mesa3d/mesa/blob/master/src/amd/vulkan/ra...

stavros · on Oct 11, 2020

I understand how big a deal this is and want to buy an AMD card for my next PC, just to support them, but is the driver actually good? Ie, is support for AMD cards on par with Windows?

The Nvidia driver is crappy, doesn't support Optimus, etc, but at least I haven't had any problems with it for as long as I've used it.

zeta0134 · on Oct 12, 2020

I bought an AMD GPU specifically for use with my Linux workstation and haven't regretted it. Perhaps I simply had bad luck with specific nVidia cards, but the AMD driver is stable in a way the nVidia driver simply never was, especially w/ respect to GPU accelerated desktop environments and screen capture utilities. The only change I made was to switch Arch over to the LTS kernel, as the upstream kernel in Arch isn't quite as battle hardened, and did occasionally require a rollback. That's not something that's likely to affect any other distro though, it's a side effect of Arch's bleeding edge nature.

Anecdotal data and all that. I'm on a Radeon VII, pretty darn solid, will probably continue to choose AMD cards in the future. Wish the Windows driver were a bit more stable, and it's... frankly weird to be saying that in comparison to the Linux driver for the same card.

posguy · on Oct 12, 2020

I have had a similar experience with my laptop that uses a Ryzen 5 Pro 2500U, it would crash once every couple days under Windows, but no such issue crops up using mainline drivers on Debian.

Symmetry · on Oct 12, 2020

Yeah, from what I understand the difference in driver stability between Nvidia and AMD on Linux is exactly the reverse of their relationship on Windows.

encom · on Oct 12, 2020

>Wish the Windows driver were a bit more stable

I have an AMD 5700XT in my Windows games machine, and the driver is an absolute travesty. And looking over the installed files is a horror show. Qt5WebEngineCore.dll and avcodec-58.dll, because a browser engine and ffmpeg are essential in a device driver. And why does FacebookClient.exe exist? Fuck knows.

qayxc · on Oct 12, 2020

It's even worse with NVIDIA - they ship an entire custom NodeJS with their drivers.

Also don't forget that these are user-space apps that are simply bundled with the driver but not necessarily part of it. Qt5WebEngineCore.dll is most likely used by the UI portion of the driver (settings dialogs, radeon software etc.), same with the ffmpeg dll and the facebook client.

NVIDIA does the exact same btw. - see [1]

[1] https://www.ghacks.net/2020/03/13/nv-updater-nvidia-driver-u...

encom · on Oct 12, 2020

I think that crap is only included if you install "Geforce Experience". Nvidia parted off their garbage into an optional component, while AMD forces you to install it.

For anyone using Nvidia on Windows, here's a useful tool to carve out most of the trash from the driver prior to installing.

https://www.techpowerup.com/download/techpowerup-nvcleanstal...

tracker1 · on Oct 12, 2020

Still nothing compared to typical RGB control software, that on Windows only runs while actively logged in (stops when locking the screen/desktop) instead of a tight/light service that uses the "last known" config that updates from the desktop/gui. Let alone painfully missing tecnical docs or support for Linux.

bavell · on Oct 12, 2020

Does AMD make you sign in to open their locally-installed driver utility? Nvidia seems to have thought it was a good idea and went ahead and did that a year or so ago...

encom · on Oct 12, 2020

No, and neither does Nvidia. "Geforce Experience" is not the driver. It's just bloatware nobody needs.

wj · on Oct 12, 2020

I have the same card and have issues with locking up and crashing in both Windows and Linux. It looks like the kernel in Ubuntu 20.10 might have a fix for some of issues.

djsumdog · on Oct 12, 2020

I remember back when AMD had both closed and open drivers and, having trouble with the proprietary drivers, I switched to the OSS version. It was NIGHT AND DAY. Games that would crash and had weird oddities now ran smooth with higher frame rates. A lot of weird video issues, especially those that come from running more than one X server or Xnest, all went away.

I would only use AMD cards in my Linux boxes. The nvidia drivers/cards pale in comparison.

broodbucket · on Oct 11, 2020

At least on par. That said, the AMD driver on Windows is notoriously crap compared to Nvidia.

I'm hoping AMD's next gen turns out to be competitive with the RTX3000 series for my next GPU for the same reason.

ixacto · on Oct 12, 2020

That is really interesting, I haven't had an AMD card for >10 years now and would really like to be rid of Nvidia due to the closed source drivers. How is suspend/resume? Thinking about canceling my pre-order queue with EVGA and getting a 6xxx card.

ploxiln · on Oct 12, 2020

For system integration stuff like suspend/resume, display hotplug and resolution changes, etc, the open-source radeon driver is good, and probably the best option on linux. The 3D accel is not bad (but not as good as nvidia).

However, don't expect a new Radeon GPU to be well supported on day of release, expect 1 kernel release cycle until it basically works, and one more until it has most of the bugs ironed out, and then wait until your favorite distro gets that kernel. So you're looking at 3 to 9 months depending on what distro you use.

I'm personally going to be looking for people selling their RX 5700, to replace my RX 480 ...

bipson · on Oct 12, 2020

Agree mostly, but the best option under Linux seems to be Intel graphics (or at least was until a few years ago) - arguably not beefy enough for some things, but regarding supported features, stability and power consumption the best supported mainstream gpu in the Linux kernel.

Intel simply has no closed source driver for Linux. New hardware is often supported/merged before it is even sold. AMD is trying the same, but not there yet.

bavell · on Oct 12, 2020

The Intel i915 driver STILL crashes my system regularly even on the latest kernels... I have a skylake i5-2600k and the iGPU is absolute dogshit. Not sure if it's a hardware or driver issue but it still hasn't been sorted out after all these years.

Typically the entire system will freeze (speakers will continue to play whatever was in the short audio buffer - pretty awful) for 10-15s, then the driver will detect the hang and reboot the iGPU. Happens much more frequently (every ~15m) when using more graphically intense programs. I can't use blender because sometimes when it hangs it won't reset and requires a full reboot.

There are dozens of issues about it and related problems in Intel's drm fork of the kernel [0]. I (finally) posted a bug report about it months ago since it seemed to have gotten worse after 5.4 but never heard back from them.

All this to say - be wary of Intel graphics on linux.

[0] https://gitlab.freedesktop.org/drm/intel/-/issues

billyjobob · on Oct 12, 2020

Ever since kernel 5.7 was released my i7-5500 will not boot. (Well it will boot with “nomodeset” option but then X doesn’t work so not very useful.) It’s still not fixed in 5.9.

tracker1 · on Oct 12, 2020

Wouldn't even say that, I've experienced regressions/bugs on intel drivers for laptops a few times.

In general, it's kind of a crapshoot no matter which way you go, and expect pain if the gpu chipset is less than a year old.

shmerl · on Oct 12, 2020

> The 3D accel is not bad (but not as good as nvidia).

How is that? I think Mesa provides state of the art OpenGL and Vulkan support, especially with work on ACO. Nvidia doesn't have any edge in that anymore. They did a few years ago still, but not today.

badsectoracula · on Oct 12, 2020

Last time i checked (which was about a couple of months ago), Mesa had very primitive support for display lists (most of the time you get a command playback though if you only submit vertex commands it gets converted to VBO - and i think that was added recently-ish) whereas Nvidia's perform optimizations in background threads to convert in the best GPU format, split as necessary to minimum calls and when rendering it performs culling before processing the full list. AMD's Windows drivers also do some of that stuff (though not all).

Mesa does implement a lot of stuff but they do not take much advantage of what the higher level parts of the API allow to optimize rendering. From what i remember until AMD pushed some devs on it, they didn't care about supporting the entire API at all.

Vulkan support is most likely good though.

(EDIT: yes, "display lists are deprecated", but this is irrelevant, the API is there, available and works and works great on Nvidia and still very good on AMD Windows driver and a lot of applications use it - Khronos splitting the API to core/compatibility was a mistake that made everything more complicated than necessary when what they should have done if they wanted a clean API would be to make something new like they eventually did with Vulkan and avoid messing up OpenGL )

account42 · on Oct 12, 2020

> Mesa does implement a lot of stuff but they do not take much advantage of what the higher level parts of the API allow to optimize rendering.

There is always more that could be optimized, especially when it comes to niche use cases, but generally Mesa/radeonsi do a decent job of making things fast.

> yes, "display lists are deprecated", but this is irrelevant, the API is there, available and works and works great on Nvidia and still very good on AMD Windows driver and a lot of applications use it

By "lot of applications" you mean some workstation applications that refuse to upgrade their code. You can still use AMD's closed source driver on Linux if you need optimizations for those. If you don't (and most people won't) then Mesa works extremely well.

> Khronos splitting the API to core/compatibility was a mistake that made everything more complicated than necessary when what they should have done if they wanted a clean API would be to make something new like they eventually did with Vulkan and avoid messing up OpenGL

You could argue for drivers not providing newer features in the compatibility profile (and Mesa did that until recently) but as long as there are customers demanding support for newer features while refusing to move off the older APIs, this is what you will get. I don't think having OpenGL Core and OpenGL Compat sharing some of the API hurt anything here.

badsectoracula · on Oct 13, 2020

> There is always more that could be optimized, especially when it comes to niche use cases, but generally Mesa/radeonsi do a decent job of making things fast.

Sure, i didn't dispute that, what i wrote was that Nvidia's drivers are faster in some cases based on code i've actually seen. And they used to be slower until not too long ago in that case too, so it isn't like they aren't improving. But still Nvidia's implementation is faster.

> By "lot of applications" you mean some workstation applications that refuse to upgrade their code. You can still use AMD's closed source driver on Linux if you need optimizations for those. If you don't (and most people won't) then Mesa works extremely well.

I mean games, applications and tools, not workstation applications. Not every application uses the latest and -rarely- greatest version of everything out there nor all applications are always updated - or even under development (especially games). Those that are may have other priorities too.

But why an applications uses some API is irrelevant, the important part is that the API is being used and one implementation is faster than another, showing that that other implementation has room for improvement.

> You could argue for drivers not providing newer features in the compatibility profile (and Mesa did that until recently) but as long as there are customers demanding support for newer features while refusing to move off the older APIs, this is what you will get. I don't think having OpenGL Core and OpenGL Compat sharing some of the API hurt anything here.

My point was that the split itself was a mistake (it isn't like splitting OpenGL into Core and Compatibility was a mandate from heaven -or hell- it was something Khronos came up with) and the hurt was that it make things complicated for a lot of people (e.g. not everyone cares about having the best performance out there - some applications are, e.g., tools that wont even come close to using even a 1% of a GPU's power, but they'd still prefer to rely only on open APIs instead of some proprietary one or some library that may be abandoned next year - code written for OpenGL 1.x 25 years ago can still work fine in modern PCs after all) and split the OpenGL community into two "camps".

This created issues like libraries and tools only supporting one version or the other, tons of bugs and wasted time for "integrating" to Core (or supporting both Compatibility and Core), invalidating a ton of existing knowledge and books (OpenGL being backwards compatible down to 1.0 is very helpful since you can always start at the beginning with something proven and work your way towards more modern functionality in an as-needed basis) and at the end all of that was a huge waste of time since everyone outside Apple decided that Compatibility is necessary - and Apple decided that splitting OpenGL in two halves wasn't enough, so they made everyone's life even harder and came up with a proprietary API all on their own.

shmerl · on Oct 12, 2020

ACO developers will work on OpenGL at some point too. OpenGL in general isn't the case I worry about, as long as it performs sufficiently well. All modern things should be using Vulkan anyway, especially if something requires focus on performance.

And deprecated features? I think there are better things to focus on first optimization wise.

badsectoracula · on Oct 12, 2020

Well, the original comparison was with Nvidia's driver and Nvidia has a much more optimized driver.

Also it is much more practical (and realistic) to have a few devs optimize a handful of API implementations than expect the thousands of devs who work on thousands of applications to do that (also why OpenGL etc isn't going anywhere).

shmerl · on Oct 12, 2020

> Well, the original comparison was with Nvidia's driver and Nvidia has a much more optimized driver.

I wouldn't say that. In all common cases they don't. And as above, deprecated features is the last thing I'd start comparing that on. If you use something deprecated, worrying about performance shouldn't be the case, rather you should worry about rewriting your code.

badsectoracula · on Oct 13, 2020

That sounds just like sour grapes :-P "Mesa is as fast as Nvidia" "But they are slower in these cases" "That doesn't count".

microcolonel · on Oct 12, 2020

At least on the hardware that I've had, it's basically rock-solid in practice. I use it with high-refresh-rate monitors, I've tried FreeSync and that works, it works with all my displays, and recently the older of my GPUs (Radeon Pro WX 7100) finally got audio output over DisplayPort, as the newer of them (Radeon VII), though I never really had any use for that feature.

The acceleration, particularly with RadeonSI and RADV, and particularly as the RADV developers (independents, Valve, and some smaller companies I wish I remembered the names of) have been making massive improvements on the shader compiler side. RADV's own shader compiler (ACO) is noticeably better than the first-party AMD LLVM stack, and RADV is substantially faster than any of the first-party AMD Vulkan drivers for both graphics and compute workloads. I hope ACO in RadeonSI becomes a thing, I think it will be a major improvement.

Message to anyone listening from AMD: maybe look into making ACO your primary target rather than LLVM, it is clearly a better design for your GPUs, it has substantially less overhead, and there's no legal reason it can't be a part of all of your drivers.

As for kernel support, it is often same-day or at least it can access the displays on launch day, provided you have the latest stable kernel. ArchLinux is rarely that far behind a new stable kernel release, so on ArchLinux, same-day support of one form or another, and full support that day or some day soon, is the norm.

shmerl · on Oct 12, 2020

Suspend / resume works fine with my Sapphire Pulse RX 5700 XT.

m-p-3 · on Oct 12, 2020

Is it really crap? I have it and it feels stable and the Crimson UI seems well made. It feels way better than the Catalyst days.

badsectoracula · on Oct 12, 2020

It is crap enough for me (RX 5700 XT user) to keep a backup of the few previous successful drivers so that when one inevitably breaks things i can roll back to a previous driver.

Some issues i had with a variety of AMD drivers on my current PC from the top of my head: turning on the monitor before the PC would cause the GPU to not realize there is a monitor attached, letting the monitor to go to power save mode would also cause the GPU to think the monitor was lost, settings for display scaling would be lost after every full reboot (full=real reboot, not the fast hibernate based one Win10 do most of the time, you get a full reboot after updates, some installs, etc), random full system hangs when trying to play GPU accelerated video (which is pretty much most videos on web as well as some applications like Microsoft's new XBox Games app), random reboots too, etc.

So i tend to be careful with updating the drivers. Last issue i had wasn't as bad the random hangs/reboots (which fortunately hasn't happened recently) but i simply couldn't launch the crimson UI at all. I had to do a full reset and reinstall of the drivers for it to appear again.

In comparison updating to the latest Nvidia driver when i had an Nvidia GPU (which was since early 2000s to ~2 years ago) was basically a non-issue: i wouldn't even think twice about it as i never had any issue.

And FWIW that was the same on Linux too: i never had issues with Nvidia's drivers there either and performance was more or less the same (at least for OpenGL stuff). But note that i avoid stuff like Wayland, hybrid GPUs, etc like the plague.

Maken · on Oct 12, 2020

turning on the monitor before the PC would cause the GPU to not realize there is a monitor attached

I have a similar issue with a Dell display attached to an AMD card. After suspending the PC, the monitor does not detect the PC at the other end of the DP cable, except for Amazon Basic cables which work for some reason. Digital standards are weird.

thg · on Oct 12, 2020

I've had all the same problems with my recently bought DisplayPort Monitor (previous ones were all HDMI and worked flawlessly).

The fix for me was switching from Xorg to Wayland. Haven't had a problem since, apart from Steam not liking it all that much.

simooooo · on Oct 12, 2020

Interesting you mention this standby issue. I just moved a monitor from an nvidia setup where it had zero issues.

Now when you turn the laptop (with Radeon gfx) on, it requires me to turn the monitor off and on before It is recognised.

johnchristopher · on Oct 12, 2020

This back and forth in this thread about nuisances like that is one of the reasons I am definitely sticking up to Intel integrated GPU when running Linux. It's 2020 and stuff like that should be much smoother :-(.

badsectoracula · on Oct 12, 2020

Note that in my comment above i was referring to the Windows AMD driver. I haven't used Linux much with this machine (though when i did it had a 50/50 chance to completely hang the system, but i think this was an issue with the kernel and the then-new Zen APUs that was quickly fixed).

bipson · on Oct 12, 2020

I have Lexa PRO in my workstation (Fedora) - Suspend/Resume works so far.

I have an issue though where switching off the monitor for a few days might make the AMD card disabling the outputs and not recognizing the monitor afterwards (I think it is related to the order in which I try to "wake" the monitor) - which I cannot recover from without rebooting the machine.

But this is with a machine never going into suspend or any sleep state - and I can't say if this would be the same with the NVIDIA card. I do not use the NVIDIA card for video output because the proprietary driver would regularly stop showing my desktop - or suddenly any output at all after reboot.

The integrated Intel GPU on my laptop is mostly without issues whatsoever.

On laptops I would still recommend Intel GPUs anyway for power consumption reasons - although AMD APUs are quite interesting and I don't have recent knowledge about how well they compare. The CPU and its ability to lower power consumption under sleep is also relevant there, and this was way better under Intel so far. Unless you need the increase in performance an AMD GPU/APU would offer...

ErikBjare · on Oct 12, 2020

I have a similar issue with Nvidia on Linux. My larger display is slow to start, so I have to rerun xrandr after suspend in order to get it working.

Agentlien · on Oct 12, 2020

I remember the Catalyst days. I used to work for a company which included a pc in the price when selling its software. We unofficially supported people who would run it on their own PC but eventually had to put our foot down and explicitly state that we wouldn't support AMD cards.

stavros · on Oct 12, 2020

Hmm, that's a low bar, huh. Is AMD on Linux anywhere close to Nvidia on Windows?

megameter · on Oct 12, 2020

The thing is, Nvidia also has issues, but their PR game is historically better. Many graphics developers have had experiences with Nvidia support where they run into a strange bug and are instructed to set a magic value to enable a driver hack. AMD drivers have had good and bad periods and hacks of their own, but are usually better behaved in this respect. But it's actually Intel that gets the most praise for adhering to spec, and therefore being a useful baseline. So user perceptions and dev perceptions diverge on what makes the drivers good, actually, and this has shifted with the different generations of APIs too; as we've gone towards a lower level access model, the basic driver functionality has become less focused on performance hacks, but there is a lot of legacy support there to support old games.

We're long past the worst period for Radeon on Linux which was back in the 2000's with "fglrx" - a driver that I never managed to get working. The new stuff will run with some competence.

DoofusOfDeath · on Oct 12, 2020

I recently bought a RX 5700 XT, and installed it on a computer that first ran Linux and then Windows 10.

In Linux, the driver (including audio) seemed very robust, but I didn't find anything like a detailed control panel for the card's graphics features.

On Windows, the AMD-supplied control panel has plenty of knobs and buttons, but the driver itself seems less robust, particularly w.r.t. audio-over-HDMI.

stavros · on Oct 12, 2020

That's very informative, thanks. I wonder if there's a cli utility on Linux instead...

shmerl · on Oct 12, 2020

You can use radeon-profile and corectrl.

https://github.com/marazmista/radeon-profile

https://gitlab.com/corectrl/corectrl

ivolimmen · on Oct 12, 2020

Thanks for the links; I will definitly test those out

shmerl · on Oct 12, 2020

Sure. For some reason they aren't packaged yet in common distros, so that makes them not well known.

RealStickman_ · on Oct 12, 2020

Maybe have a look at corectrl, which aims to create a beautiful control panel for graphics cards.

aorth · on Oct 12, 2020

What about: is AMD on Linux anywhere close to Intel on Linux? No games, just 3D acceleration for the desktop, bug-free suspend and resume, etc.

KozmoNau7 · on Oct 12, 2020

Maybe I'm just lucky, but I have not had a single issue on Windows with my RX560. I know AMD/ATI drivers used to be horrible on Windows back in the day, but I really think they've gotten a lot better, I'd say on par with Nvidia's.

sph · on Oct 12, 2020

It is not. My 5500 XT is unusable when using 2 monitors. https://gitlab.freedesktop.org/drm/amd/-/issues/929

Apparently AMD doesn't have the resources to debug these millions lines of code, since this has been open for a year now.

Yet people still say NVIDIA on Linux has issue. They don't support Wayland and tend to lag behind with Linux only tech in general, but the driver itself is top notch. I haven't had an nv driver crash on Linux in 10 years. It's only the same echo chamber borne by the famous moment of Linus flipping NVIDIA the bird.

bipson · on Oct 12, 2020

My experience is 180° opposite to what you describe.

I never had a mentionable issue with AMD cards since switching to the open source driver approx. a decade ago. I have a NVIDIA 1060 card in my workstation for CUDA - every single time I put it in running state again, I have a realistic chance of completely borking my system.

In fact I had an AMD card installed after the first two incidents, simply to have at least a chance of having working video output when the NVIDIA driver once again doesn't want to talk to the kernel.

That and the whole practical implications and idealistic differences of having an (mostly) open source driver vs. a (mostly) closed source driver (I think we can agree that the open source NVIDIA driver is out of the discussion).

Obviously you might run into problems if you try to run very recent hardware right after availability. Kernel driver development is not ideal for cutting edge hardware and some things might break and it might need some time for your distro to ship the newest kernel/driver.

Teknoman117 · on Oct 12, 2020

The Nvidia driver started supporting proper optimus at the begining of 2020 (can runs apps on integrated and dedicated cards simultaneously). I use it regularly on my XPS 15 (to play Kerbal Space Program). It's called "DRI PRIME". You have to set an environment variable for starting and application saying what GPU you want it to run on.

I am, however, very much looking forward to the new AMD GPUs. Hopefully the RX 6000 series will be near a 3080 in more than the 3 hand picked games in their teaser. Would love to use Wayland on my desktop.

stavros · on Oct 12, 2020

That's interesting, thanks, I tried to use Optimus on my XPS years ago but it wouldn't work. I'll try it now, thanks!

jl6 · on Oct 12, 2020

Search for “amdgpu ring gfx timeout”. There seem to be a whole class of bugs that have been open for years which not only haven’t been fixed, but there isn’t even any clear indication of what the root cause(s) is/are.

talex5 · on Oct 12, 2020

I tried a couple of different AMD cards, and my machine crashes on resume if I try to use either of them (but the Intel iGPU works fine).

Searching for amdgpu bug reports leads to:

https://amdgpu-install.readthedocs.io/en/latest/install-bugr...

which links to a page saying "Bugzilla is no longer in use" :-(

This is under Qubes/Xen, though, so maybe that causes extra problems. If any devs are reading, I did report it here in the end:

https://github.com/QubesOS/qubes-issues/issues/5459

garaetjjte · on Oct 12, 2020

It could be misbehaved applications. While AMDGPU and Mesa are much faster than AMD proprietary driver (on some OpenGL workloads I have seen 2x improvement compared to AMDGPU-PRO or Windows driver) and are normally stable, I had several issues where bad shaders brought down whole GPU (with "ring gfx timeout"). Things like out-of-bounds access or division by zero.

KozmoNau7 · on Oct 12, 2020

I upgraded from a Geforce 460GTX to a Radeon RX560, and I ran into two issues. Nothing major, and I've had worse issues with the Nvidia drivers, but they are still something to be aware of.

The first was that my distro (KDE Neon based on Ubuntu 18.04) shipped an older version of Mesa at the time, which was too old for the AMDGPU driver, so I had to add a PPA with an updated version. Since Neon updated to a 20.04 base, it works straight from a clean install. It also worked with no issues when I switched to openSUSE Leap 15.2.

The second was that DVI output was limited to single-link instead of dual-link. My monitor at the time only supported full 1440p through dual-link DVI or displayport, and the old GPU didn't have displayport. Buying a displayport cable was a quick fix, and I believe the DVI issue is fixed in the driver now.

Aside from those two minor hurdles, it has been smooth sailing, very good OpenGL performance in the games I play.

hatsunearu · on Oct 12, 2020

Not sure if this is a driver problem but there's a LOT of general usability issues on AMDGPU + Linux. The default thermal control being absolute catastrophe for one.

symlinkk · on Oct 12, 2020

How is it a catastrophe? I game every day on AMD on Linux and have no issues. 99.9% of consumers don’t care about overclocking so if that’s what you’re referring to I think it’s a non-issue.

hatsunearu · on Oct 14, 2020

It runs 75C at idle because the fan curves are wonky.

datalus · on Oct 12, 2020

AMD has caught up to Intel but still lags behind nVidia (on Windows at least). I'm just not sure they can fight a two front war. Something has to give.

HeWhoLurksLate · on Oct 12, 2020

If we're talking about CPU's wrt Intel, and GPU's wrt nVidia, I think they'll do fine- IIRC, they're both separate internal groups with the same overall leader (Dr. Su).

tracker1 · on Oct 12, 2020

Wait a few months after a new GPU comes out, maybe until the next major version cycle (like if you want a new card that comes out in November, wait until the 21.04 Ubuntu/PopOS release).

I bought my RX 5700XT shortly after release, and was using alpha/beta kernel releases and downloading extra files manually for several months after to run, then an upgrade/update may turn into a blank screen on boot for me. It also broke out of the box support for running full VMs, which was pretty painful for me as well, and I wasn't going down that rabbit hole to try and build it myself.

YMMV of course.. but that's just my take on it.. I bought specifically for Linux support, but took a few months to shake out.

aero-glide · on Oct 12, 2020

Have you tried Nvidia on-demand option for Optimus?

shmerl · on Oct 12, 2020

It is good enough. I'd say overall Nvidia's driver is worse.

taurath · on Oct 12, 2020

I have returned 2 Radeons that I bought for specs but returned because the drivers were bad enough that I couldn’t get the same-clock performance as Nvidia or worse dealt with driver crashes and system reboots - note that this was between 10 and 20 years ago. I am highly considering trying again at eom when they announce the new cards but that it’s a Radeon is still a downside to me.

Most of the Linux community has a historical hatred of Nvidia because of the driver issue so there’s a lot of relative love out in forums, but just “stable” would be a step up for me for Radeons on windows.

morganvachon · on Oct 12, 2020

I recently built a system based around a Ryzen 5 3600 CPU and Radeon RX 5600 XT GPU, and in both Windows 10 and Linux with a 5.4+ kernel it's rock solid. Gaming in Windows is simply amazing and it pairs well with my 1440p monitor. On Linux gaming is also extremely good, with only a couple of "Windows only" titles acting buggy under Proton/Steam. Considering Proton itself is in its infancy, that's to be expected.

With native performance on official Linux games on par with or better than the Windows equivalent, and more and more games getting Linux ports due to Vulkan, I just about have no need to boot into Windows at home anymore apart from Fusion 360.

As a workstation in Windows, since I don't overclock I don't see any stability issues. Fusion 360 is fast and fluid unlike my 8 year old Sandy Bridge dinosaur at work, even after adding a GT 1030. Good quality Crucial RAM and a no-frills AsRock B450 board make for a rock-solid build. Ditto on Linux as a workstation, everything just works and works well, and it's superb for 3D modeling and music creation (two of my main hobbies).

taurath · on Oct 12, 2020

Good to hear that things have gotten better! Will be watching the oct 27 reveal of the new cards :)

pimeys · on Oct 12, 2020

I'm also very interested on giving my 2080 Ti to my partner (a Windows user) and getting the fastest next gen Radeon to myself.

armatav · on Oct 12, 2020

It is not on par at all - my 5600 got annihilated by driver issues.

AMD has incredible CPUs, but just buy an Nvidia GPU - especially if you are using linux.

commoner · on Oct 12, 2020

Nvidia has subpar support for Wayland on Linux because it uses its own EGLStreams buffer API instead of the standard GBM buffer API, which is better-supported. Both AMD and Intel use GBM.

Also, the open source driver for Nvidia (nouveau) has incredibly poor performance compared to Nvidia's proprietary driver, and lacks essential features such as reclocking for recent hardware generations:

https://nouveau.freedesktop.org/PowerManagement.html

AMD's and Intel's open source drivers are their primary offerings on Linux and have good performance across all hardware generations.

marcan_42 · on Oct 12, 2020

Intel has actually gone downhill lately, especially for prior generations. I've had to live with 5 or so years of tearing with multi-monitor support on Ivy Bridge, and even single monitor tears inexplicably with some software (that shouldn't). The Intel Xorg driver is unmaintained and the generic modesetting driver doesn't work quite as well. When I first got my Ivy Bridge system, triple head mode didn't work for a while either, so it's not like they have great support when the hardware is current either.

I've switched to AMD now and things are much better. Go with AMD.

pengaru · on Oct 12, 2020

The Xorg modesetting driver works quite reliably on Intel in my experience.

The SNA acceleration architecture in the Intel Xorg driver was a disaster in terms of correctness and stability. When SNA appeared as an option it initially seemed quite fast, but didn't take long to reveal it was also quite broken vs. UXA.

I used to explicitly use UXA but for the last 5-10 years simply using modesetting has been the way to go.

Personally I think you're conflating Xorg and kernel driver issues. Xorg is basically unmaintained in general now and unfortunately SNA was the last major development in that context for the Intel driver, and it was not good.

nzentzis · on Oct 12, 2020

This doesn't apply if you want to run CUDA-dependent software. I've generally gone for Nvidia for my personal machine since Torch has behaved oddly on AMD cards in the past.

It's true that Nvidia doesn't support Wayland properly, but that's not really an issue in my opinion. Wayland still has its own problems that mean switching from X11 isn't viable yet.

bipson · on Oct 12, 2020

Although your argument is valid, are we talking about CUDA? Obviously CUDA is an NVIDIA thing under all platforms, right? I don't think anyone would buy AMD with the intention of running CUDA.

Regarding GPUs and how good they work under Linux, computing on GPUs is only a part of the discussion I would argue...

posguy · on Oct 12, 2020

What issues have you had with Wayland? Switching to it has given me a tear free experience on both AMD & Intel laptops, besides that it performs similar to X11.

morsch · on Oct 12, 2020

> tear free

> 5 or so years of tearing

I know what people are referring to, but a less geeky person might come away from this thinking people get very emotional about bad Linux graphics drivers.

nzentzis · on Oct 12, 2020

My main problem with it is limited software support. Xmonad isn't available and as far as I can tell what support exists for screen recording and screenshots is half-baked at best. I haven't seen anywhere near enough problems with X11 to make switching window managers worth it, and the screen recording thing would be a massive pain to work around.

bavell · on Oct 12, 2020

I'm still on an Intel system (skylake) and my experience is similar to yours. 5+ years of bugs and crashes, tearing, multi-monitor headaches and general instability.

Eagerly awaiting the new AMD hardware.

JustCuteBullSht · on Oct 12, 2020

I've found the wayland server to be a great experience with intel—the only weird bits I've seen is full-screen noise on firefox and poor support for high dpi, the latter of which is even shittier under X11. The server is really very usable nowadays.

AMD's ok if you have the room for the discrete card, but I wish they would invest more in integrated on-board chips.

Cloudef · on Oct 12, 2020

Modern AMD GPUs work better on linux than nvidia. No tearing, multi-monitor works, and vulkan is very smooth. Nvidia is actually less stable, and has some peculiar quirks, such as needing composite manager running to get rid of tearing, spotty multi-monitor support, etc..

sph · on Oct 12, 2020

You are dismissing people saying they ARE having issues with AMD on Linux. In fact my AMD card does not do multi-monitor, and in this thread I'm not the only one that has multi-monitor issues on AMD.

Cloudef · on Oct 12, 2020

Which card are you using? I'm aware the older cards are still bad. Especially if you still need fglrx. In my personal experience, the modern AMD GPUs on linux is first time graphics have worked reasonably well on linux. Even intel drivers are riddled with bugs and instability (not to mention they still don't even do gallium). GMA 3650 (powersgx based) being the most infamous worst driver ever.

sph · on Oct 12, 2020

A 5500 XT bought in June, so not old at all. I've heard the opposing argument, that since it's a relatively new card (out since Dec 2019?) I should expect some bugs, which is insane one year later. It's actually unusable, I have to log into my machine via SSH to restart it, or force reboot. It might break after 30 minutes or 3 days, when idle or busy.

https://gitlab.freedesktop.org/drm/amd/-/issues/929

AMD developers in that thread are chasing their tails and still haven't figured out why so many cards are having issues, and why other aren't, but as a consumer, that's really not inspiring at all.

Cloudef · on Oct 12, 2020

Funny, I have 5600 XT (Sapphire Pulse) and it runs like dream. The out of box experience with Linux has been very good. Note that some of the aftermarket cards are actually bad and the instability might not be software related. Before 5600 XT, I used R9 290, and while it did require some tweaks to enable all features (due to being older card), it still ran relatively stable and in general was better experience than any nvidia card I had used in past.

armatav · on Oct 14, 2020

This guy is having the same issues I'm having with a 5600. Multi-monitor, entirely new computer built a couple months ago.

Randomly locks up, random black screen, random rainbow colors all over my monitors.

With my new Nvidia 2060 which I bought to replace it; nothing. No issues. Works just fine on Manjaro.

For whatever reason, the AMD cards just get clapped on Linux.

JustCuteBullSht · on Oct 12, 2020

My experience with linux is that the nvidia drivers and support are the worst of the bunch, and if I had a nickle every time I could trace a kernel panic through their driver I'd get a very nice lunch. Their popularity seems to be driven primarily by exclusive access to CUDA APIs and windows gaming. Nouveau is OK for accelerated 2d but is hardly in the same ballpark as the AMD drivers.

That said I just picked up a quadro (not my choice, came with a prebuilt NUC) and I've been pleased to find that it "just works" on freebsd (I use it to realtime transcode video), so clearly great experiences are possible and I don't want to be needlessly harsh.

Personally, I'm dying for a discrete intel card. I can't recall any hiccups with intel chipsets, ever, and that matters WAY more to me than raw performance.

bdowling · on Oct 12, 2020

> the driver is pretty abominable compared to the code quality of most of the rest of the kernel.

Could you say more about what specifically makes the driver abominable? Is it just those files with largely duplicated code?

randyrand · on Oct 11, 2020

duplication 3 times with small differences between is a good case to keep separate imo.

abstraction is one of the main sources of code complexity.

you start with one function used in 3 places, then add boolean args to it to get slightly different functionality at each place, eventually it becomes a mess of complexity

broodbucket · on Oct 11, 2020

I think that's very subjective and situational.

The amdgpu driver has duplicated files for different versions of things, so it'll have thing_v6.c and thing_v7.c and thing_v8.c with a lot of duplicated functions.

The more common way of doing something like this would be to have structs of function pointers that get populated based on what version of GPU you have. You have one file with all the common functions that they can share, in the definitions for each GPU version you set the majority of the function pointers to the common version they all share and for ones that have to be different, you set them to their unique version. That way you can define all the common functions once, and point to them in the structs for each version.

Having a quick flick through the code now, they do use structs of function pointers in each version for common operations but they still don't abstract out the ones that are either identical or have very few differences that you could special case.

Refactoring such a giant driver for no performance gain is going to be extremely low on AMD's todo list, so it'll probably stay like that. It just doesn't look like anything else in the kernel

andromeduck · on Oct 12, 2020

This is literally what what everyone does in embedded C land. The repetitive definitions ate generally intended to be used with macros and are typically generated from the same definitions as the chip registers itself. Some places also auto generate embedded c/c++ structs or classes which imo is better. But I have gotten quite a bit of pushback for doing it.

A big issue also is the use of bitfields as much as reg duplication. Bitfields in c/c++ are a minefield if you don't lock down a known-good compiler version because there's just so much of it that's technically unspecified. Oftentimes you'll also have issues where certain register fields exist for some registers of a series and not the next or where the functionality/sizing/interpretation is context dependent or where certain locks or write orders are needed for correct access and these are often handed with presence checking macros.

IMO, if we want better driver code, it's time for GCC/Clang to nail down the bitfield layouts for the embedded use cases. This has been broken for far too long.

lsiebert · on Oct 11, 2020

Sounds like an excellent way for someone looking for something to contribute to get their code into the kernel though

broodbucket · on Oct 11, 2020

It would be very difficult to get accepted. You'd have to get the AMDGPU driver maintainers on board, and you'd probably have to do a lot of it at once to justify the change. It would also take some discussion, and you're talking about refactoring a lot of stuff which probably moves underneath you during this, so you have to keep iterating to keep up with the changes, all without knowing if they'll even end up taking it...

Changes like this are probably a good way to get started but I would guess the AMDGPU driver is one of the worst places to get started as a beginner.

lsiebert · on Oct 16, 2020

I mean, each new version is separate, correct? So the only change that can happen under you is when something is backported. How often does that happen for a gpu driver, and how far back does that go?

trey-jones · on Oct 11, 2020

Or you duplicate code in 3 places, and apply the same fixes or updates in 3 places for all of eternity. There are pros and cons to both methods and each have their places, no need to start this constant debate here.

shadowgovt · on Oct 12, 2020

That's why this approach can tend to be a positive for driver versions matched to hardware iterations: a given fix may or may not apply to a given hardware config, and likely has to be tested against each config separately.

It's one of the unusual circumstances where, unfortunately, abstraction can decrease flexibility and increase development time.

Cthulhu_ · on Oct 12, 2020

Proverb: "A little copying is better than a little dependency." (Rob Pike)

That is, it's better to have duplications than the wrong abstraction. This may also be in reference to C compilation, in that loading header files and dependencies costs more than inlined code. That's one of the goals that the Go language sought to resolve, anyway.

nemetroid · on Oct 11, 2020

> Though as reported previously, much of the AMDGPU driver code base is so large because of auto-generated header files for GPU registers, etc. In fact, 1.79 million lines as of Linux 5.9 for AMDGPU is simply header files that are predominantly auto-generated. It's 366k lines of the 2.71 million lines of code that is actual C code.

spullara · on Oct 11, 2020

Why not generate it during the build? Is there a good reason not to do that?

ploxiln · on Oct 11, 2020

It was generated by the hardware division. These are the registers that are authorized for disclosure in the open-source driver by the AMD employed open-source driver developers.

So, it includes many times more register definitions than are ever used (consider there are 8x more register definition lines than actual code lines that could use them) and it includes many sets of 16 or 64 definitions that a software developer would have made one parameterized definition (all the same except for _00, _01, _02, _03, etc). But this is exactly what the hardware guys generated for public release, and it is to be used as-is.

IMHO it's kinda annoying and sad. The rest of the kernel is held to a higher standard, that's why all the other non-trivial multi-arch multi-family multi-generation code in the linux kernel is much more concise / less sloppy. It takes a lot of effort to make it that way, and commercial companies pretty much never bother, except when required by the Linux maintainers.

But, modern graphics drivers are way too complex and way too much work, and most people do want some proper modern GPU support in the kernel, so compromises have to be made. It's not too bad, just a bunch of inert header lines, git and the compiler handle them just fine I guess.

varispeed · on Oct 12, 2020

Isn't that the beauty of open source, when if someone has severe OCD they could just spend their time tidying up the kernel driver instead of watching mind numbing telly?

andy_ppp · on Oct 12, 2020

I'm not sure what the score is - if these things were tidied up would AMD still be able to upstream their own changes or do they take back fixes from Kernel devs? Seems likely a complex political process...

gridlockd · on Oct 11, 2020

> It was generated by the hardware division. These are the registers that are authorized for disclosure in the open-source driver by the AMD employed open-source driver developers.

...which is arguably not compatible with the GPL:

"The source code for a work means the preferred form of the work for making modifications to it."

rleigh · on Oct 11, 2020

It's not applicable, in practice.

This is the published hardware interface for the driver, the formal public contract. You can't change it without changing the hardware itself.

If you really want to run the generator... well, the preferred form for modification is open to interpretation and if it's some proprietary tool then just getting the output is preferable to a dependency. Sometimes the rabbit hole is too deep, and we have to draw a line.

bdowling · on Oct 12, 2020

> ...which is arguably not compatible with the GPL:

From what I can tell, most if not all of the driver is licensed with an MIT-style license. But even if it was GPL, AMD would be the licensor, so it gets to decide the “preferred form of the work”.

anticensor · on Oct 12, 2020

"Preferred form for modification" is a form that is suitable for a skilled stranger to modify it with little exposure.

bdowling · on Oct 12, 2020

What I meant is that the copyright owner is not bound by the terms of a GPL license he grants to others. Similarly, a licensee who receives software from the copyright owner under a GPL license cannot compel the copyright owner to do anything.

anticensor · on Oct 12, 2020

An author that licenses the software under GPL, but does not release the source code in that format cannot legally incorporate outsider contributions into his GPL'd work as he would be in a position of infringing the derivative work author's right.

> a licensee who receives ... GPL license cannot compel the copyright owner to do anything.

Unless licensee in question has also contributed to a published revision of original licensor's code. And for that to work (remember the wording "preferred form for modification"), you need a form suitable for modification by a skilled stranger with little prior exposure to said work. You would otherwise get different preferred forms of modification of each contributor, which is unworkable.

bdowling · on Oct 12, 2020

> you need a form suitable for modification by a skilled stranger with little prior exposure to said work

That’s a nice idea, but it’s not a condition of the GPL. GPL v2 and v3 both only state, “The ‘source code’ for a work means the preferred form of the work for making modifications to it.” That definition exists because without it a licensee might try to argue that distribution of modified and then obfuscated code satisfies the source code offer condition.

Regarding a project licensed to others under the GPL, if the project owner accepts contributions under the GPL, then he becomes a licensee of the contributions. So, as you pointed out, he would need to meet the “preferred form” clause and other terms, at least as regards to the contributed portions. As you might expect, for a substantial project with many contributors, this could become very complicated. Therefore, many projects require contributions be made under a more liberal license (or even a copyright assignment) that allows the contribution to be sub-licensed to others without conditions.

anticensor · on Oct 13, 2020

> Therefore, many projects require contributions be made under a more liberal license (or even a copyright assignment) that allows the contribution to be sub-licensed to others without conditions.

Most, but not all of European jurisdictions, have a legal stipulation that all copyright assignments are either void or revocable even if the assigner says otherwise, except for work-for-hire. You therefore cannot release yourself from preferred form even if you required a copyright assignment, otherwise you will get stuck in the case any further published modifications to your work, not only for the contributions, but any part those modifications that interact so much so that they are inseparable, even by the original licensor, may become illegal overnight. As GPL does not state "the form deemed preferred for modifications by the licensor(s)", but " preferred form ... for modifications", you need to apply that objective definition I stated above. It would be nice if they explicitly stated that way though, relieving a lot of load from judges in resolving a possible dispute on which forms are preferrable for modification and which are not.

bdowling · on Oct 13, 2020

It may help to think about who can sue whom. Generally only a copyright owner can sue an infringer. A license operates as a defense against a claim of infringement. If a licensee fails to meet a condition, then the license is invalid.

So, in the case of the project owner who (1) starts out owning all of the rights to the project, (2) incorporates code licensed from a contributor and, (3) distributes the combination, the only person who could possibly sue the project owner for copyright infringement is the contributor. The claim would only pertain to the contributor's code, because that is the only part he owns the copyright to. The project owner/defendant would raise the license as a defense and the key question would shift to whether the owner/defendant violated any of the conditions of the license.

Where the license is the GPL, one of the conditions is partially affected by the "preferred form" definition of source code. The court would look at what the owner/defendant did and whether he met that condition. Importantly, the condition and "preferred form" definition would only be considered in relation to the plaintiff's code; the owner/defendant's code wouldn't be relevant.

Regarding the contributor's code being "inseparable", that will not be the case for one very simple reason: If the contributor sues the project owner, then he must identify which portion of the code he is suing about. If he can't do that or can't show ownership of it, then he will lose.

anticensor · on Oct 14, 2020

> license operates as a defense against a claim of infringement

It works like that in fully assignable IP jurisdictions (like USA), but it works like a contract of adhesion in the author's compulsory rights jurisdictions (like Germany and Czechia).

What I meant by inseparable contribution was a significant contribution, when eliminated, that would make entire work not resemble the current state of the work; i.e. the line that tells derivative work versus near-equal co-authorship apart (which are treated similarly in fully assignable IP jurisdictions, yet have entirely different regimes in the compulsory rights jurisdictions). Not the entirety of the work indeed.

> the condition and "preferred form" definition would only be considered in relation to the plaintiff's code; the owner/defendant's code wouldn't be relevant.

It would, in a compulsory rights jurisdiction, because all copyright assignments are either void or revocable at will in such jurisdictions.

bdowling · on Oct 15, 2020

> It would, in a compulsory rights jurisdiction, because all copyright assignments are either void or revocable at will in such jurisdictions.

I didn't believe this, so I looked at a study of EU copyright law[0]. Rights of authors are split into moral rights and economic rights. Economic rights are transferable as property. Moral rights, however, inure to the author and are inalienable. In some countries, the moral rights include the right to withdraw the work from circulation. This right to withdraw is probably what you are referring to when you say that copyright assignments are void or revocable.

The right to withdraw a work from circulation, however, does not come for free. In Spain it is only, "after indemnification of the holders of exploitation rights for damages and prejudice."[1] In Estonia, "The rights ... shall be exercised at the expense of the author and the author is required to compensate for damage caused to the person who used the work."[2] In France, "... he may only exercise that right on the condition that he indemnify the assignee beforehand for any prejudice the reconsideration or withdrawal may cause him."[3] In Romania, the right is "subject to indemnification of any holder of exploitation rights who might be prejudiced by the exercise of the said withdrawal right."[4]

In all of the examples I could find, the withdrawal right essentially extinguishes an assignment of the economic rights. So, in a sense you are correct that an assignment is revocable. Practically, however, the author who exercises that right would be liable for damages to the assignee, which could be significant, and the author would not be able to exercise the right if he could not pay for the economic harm.

Anyway, this has been interesting and I learned something about European copyright regimes. Thanks.

[0] https://www.europarl.europa.eu/RegData/etudes/STUD/2018/6251...

[1] Id. at 134.

[2] Id. at 93.

[3] Id. at 173.

[4] Id. at 301.

tom_ · on Oct 11, 2020

What modifications would you make that might be useful? The (proprietary) hardware isn't going to change.

ptx · on Oct 11, 2020

If the generated code is a representation of certain unchangeable data about the hardware, you might still want to

1) represent it more compactly;

2) represent it in a form that can more easily be read and transformed to handle future use-cases for the data;

3) after some future restructuring of the driver, represent the data in a form that better fits with that structure.

If you have to regenerate the code using the proprietary tool in order to restructure the driver, the generated code is not "the preferred form of the work for making modifications".

tom_ · on Oct 11, 2020

All you're going to end up doing is changing the names. And for that, in my view, a big long list of defines (or whatever), autogenerated or not, is as good a form of the work as any other.

And, besides, there is an excellent chance that you will never end up changing the names.

admax88q · on Oct 12, 2020

You might want to port it to a new language, in which case having the hardware description and a generator tool is easier and better than converting the C headers.

And yeah sure pragmatically it might not make much of a difference in this specific case, but if the AMD devs were to port their driver to a new language they wouldn't edit the C headers they would certainly just update their generator, so the preferred form for modification is clearly not the generated C headers.

Not to mention if all you wanted to do was change the names, maybe prefix them with something, editing the generator is _still_ clearly the preferred format for making that change.

tom_ · on Oct 13, 2020

But the GPL applies to the driver they released, not some hypothetical driver that you or somebody else might create in the future. You're already going to have to rewrite it all in this proposed alternative language... this header is the least of your worries.

Strikes me that AMD have supplied everything required: all the driver code in the preferred form for modification of the driver, i.e., a bunch of C files.

Some of these C files are a big long list of slightly opaque magic number defines that relate to the hardware, perhaps generated by some unreleased tool, who can say - it's all speculation at this point - but that's OK! The hardware is not the bit you're going to modify. As far as the people modifying the drivers are concerned, those numbers are never going to change. This portion of the driver is fixed.

andromeduck · on Oct 12, 2020

You fundamentally can't because the defining code is usually hard core proprietary or a proprietary toolchain artifact from cadence/synopsis. We're talking like a memory map of the entire system or 1MB+ XML blobs.

Honestly lifting it from a header file manually is going to be easier for everyone.

admax88q · on Oct 13, 2020

What do you think AMD would do if they decided to port their driver to a new language? Would they update their generator or would they copy and edit the existing header?

They sure as shit didn't type out these header files by hand, so clearly these are not the "preferred form" for modification.

dmurray · on Oct 11, 2020

Remove some bugs, or improve its performance. Hardware drivers get updated all the time even when the hardware remains the same.

I'm not an open-source absolutist: I think the pragmatic solution Linux went with is good here. But it's silly to suggest that the driver couldn't be improved if it were more open.

akvadrako · on Oct 11, 2020

The topic is not the driver - it’s the definition of the lowest level hardware interface.

It’s lists of registers and stuff like that; not things that can really be fixed by external devs.

de_watcher · on Oct 12, 2020

We can say that it's generated from the "hardware schematics". AMD hardware isn't an opensource hardware.

userbinator · on Oct 12, 2020

These are the registers that are authorized for disclosure in the open-source driver by the AMD employed open-source driver developers.

In other words, there's more functionality that they're keeping secret? Sounds like a challenge...

Edit: so the hacker spirit is not welcome here...?

eesmith · on Oct 12, 2020

Modern high-end processors have a lot of undocumented features. This is rather widely known, though of course not universally known. These have existed for a long time - https://en.wikipedia.org/wiki/Illegal_opcode .

And ... you know this. Checking your comment history https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que... I easily found https://news.ycombinator.com/item?id=8834863

> Intel CPUs have had undocumented features since their introduction; it's not hard to imagine their chipsets do too.

Before I did the search I thought you were one of the 10,000 (https://xkcd.com/1053/ ), surprised that others didn't share your enthusiasm. Now I don't understand your surprise.

You must surely know your comment about "the hacker spirit is not welcome here" comes across like snobbish gatekeeping, yes? At the very least, the implied lack of knowledge about undocumented features makes it seem that you aren't one to judge what the hacker spirit might be. ... which cannot be correct given your posting history.

fizixer · on Oct 11, 2020

What you're describing sounds like a binary blob masquerading as C header files.

marcan_42 · on Oct 11, 2020

It's just a list of registers names, offsets, field names, and bit assignments. It is nothing like a binary blob. It is GPU documentation in the form of C header files.

Now I happen to know that the vast majority of it is shared between GPU generations to some extent, so someone could abstract things out manually to remove duplication, but it's a huge task.

barumi · on Oct 11, 2020

> Why not generate it during the build?

If those headers aren't expected to change then, with regards to accountability, it's far better to have the code checked into the version control and processed as is.

More importantly, if the code is already generated then there's no need to make the build system more brittle by adding a non-standard build target that depends on custom/third-party tools.

stefan_ · on Oct 11, 2020

The Linux kernel doesn't play the Firefox game of requiring Ruby, Python3, Python2, NodeJS and Rust to even be able to build the thing.

nathell · on Oct 11, 2020

Not for the build itself, but there are Perl and Python scripts in the kernel source, which are referenced by the kernel's main Makefile.

Ar-Curunir · on Oct 11, 2020

If you don’t want to use languages that enable code reuse, then you can’t complain about repeated code

Taniwha · on Oct 11, 2020

>Why not generate it during the build? Is there a good reason not to do that?

In many hardware shops the C definitions for the visible registers are generated automatically from the hardware's source code

Taniwha · on Oct 12, 2020

and I should add ... chances are the linux kernel is not the primary user of these addresses, more likely it's the internal DV ('Design Verification' - hardware QA/testing) teams who need access to all those internal debug/setup/etc registers that are not normally architecturally visible to the downstream software teams (like Linux/etc)

symlinkk · on Oct 12, 2020

Wait hardware has source code? I’m just a web dev so I’m not aware of this. What does the code look like?

pwg · on Oct 12, 2020

Yes, there is "source code" for describing hardware. Here are two you can take a look at:

VHDL: https://en.wikipedia.org/wiki/Vhdl

Verilog: https://en.wikipedia.org/wiki/Verilog

Taniwha · on Oct 12, 2020

Depends on what you're building, boards tend to be done as net lists (essentially a list of components and wires and how they are connected) - but digital chips of more more than a few dozen gates are normally written in a highlevel language (linked to by other posters here) which can be compiled into both machine code that can be simulated, and synthesised into gates and wires (a net list) that can be laid out onto a chip

aasasd · on Oct 12, 2020

I'll tell you more, hardware is often simulated in software from those source descriptions before going to any sort of production. Which is probably one of the reasons for the existence of these definition languages.

saxonww · on Oct 11, 2020

If it never changes, generating the headers during the build is just wasted time for whoever/whatever is running that build.

henriquez · on Oct 11, 2020

The real shame is that Nvidia is still doing binary blob drivers 15 years after I started caring about Linux. Are they really that afraid of someone taking their Lucky Charms?

etaioinshrdlu · on Oct 11, 2020

My new theory is the Nvidia driver can't be GPL and in the Linux kernel, because then they couldn't ban datacenter usage of their GeForce cards by not licensing the driver for datacenter use. The upcharge on the Tesla series of cards is huge compared to GeForce for mostly the same chips. (For those not aware, see if you can find and GTX 2080 or 3080 from a cloud provider. It's not a thing. This is actually a huge deal for the machine learning industry, massively increasing costs. I doubt Google would have made the TPU if not for this.)

Also, their driver is very complex, and they are constantly improving their hardware. They don't want to be dependent on getting new features and performance improvements upstreamed.

userbinator · on Oct 12, 2020

Don't forget the fact that most of their silicon is basically the same and you can easily change it with some hardware/software mods[1] --- I think they have tried to lock that down a bit more, but ultimately it's a cat-and-mouse game and the only ones who win are those willing to ignore the insanity of Imaginary Property laws and take matters into their own hands.

[1] https://www.eevblog.com/forum/general-computing/hacking-nvid...

Apofis · on Oct 12, 2020

AMD does the same thing and so does Intel, this is for CPU's too. The yield on silicon has a probability that some transistors wont work, so they disable those cores and create lower end models. Sometimes, to meet demand they do just simply disable cores, as it's also cheaper to have one process. Tesla does the same thing as well with their cars, funny enough.

AtlasBarfed · on Oct 12, 2020

IBM did it for their mainframes back in the day.

qayxc · on Oct 12, 2020

They still do to this day.

breck · on Oct 12, 2020

I've spent a lot of time trying to come up with a better term for these laws, and I think your "Imaginary Property" phrase here is better than anything I've come up with. Thanks!

amelius · on Oct 11, 2020

> not licensing the driver for datacenter use.

Why are these kind of licenses even allowed. If I buy a product, surely I can do with it as I please?

Also, why doesn't TSMC slap a license on every IC that leaves their fab, taking (say) a 30% profit from every application in which their ICs are being used?

nine_k · on Oct 11, 2020

Sure you can do anything with the hardware which you actually ha e bought!

The problem is in the software (the driver) which you never can buy, only license under a long list of conditions which prohibit specific uses.

If e.g. Noveau could implement interfaces needed for CUDA, you could probably try to use a 3050 in a datacenter. I bet NVidia has provisions against this turn of events, too.

amelius · on Oct 11, 2020

> The problem is in the software (the driver) which you never can buy, only license under a long list of conditions which prohibit specific uses.

Ok, so who gave software a special status over hardware? Is this desirable? Can we reverse it?

mr_toad · on Oct 11, 2020

> Ok, so who gave software a special status over hardware?

Software is rarely sold (outside of bespoke development). All the off the shelf software is essentially rented.

Software itself has no legal value - the copyright is what is considered to be property. That property can be leased or sold. This is why copyright infringement is called infringement and not theft.

When you “buy” software, you are actually entering into a lease contract to use the software (sometimes perpetual, but increasingly only temporary) which can have various terms and conditions (that you really should read, but never do). But that lease doesn’t grant you the copyright.

amelius · on Oct 11, 2020

I think that's misleading then, because when I buy a GPU, they make me believe I own it, when, apparently, in reality I don't.

I don't think this way of selling (or as you say renting) stuff should be considered legal.

Polylactic_acid · on Oct 12, 2020

I think your idea is agreeable but if we did treat hardware this way it changes almost everything. Apple/Nintendo/Sony/etc would all be required to give users root access to the software and remove their ToS.

And then it get even more complex when you get in to online services. Game consoles are going online only next gen. If you buy the ps5 digital edition and you mod your OS and sony bans you from their servers, your console is now a brick. But in many cases its fair to be banned such as banning cheaters.

jopsen · on Oct 11, 2020

You own it, and can talk to it the same way you can talk to a brick, lol :D

(I avoid nvidia whenever possible)

daxfohl · on Oct 11, 2020

What's the point of that? We do the same thing in software all the time. You get basic functionality for one price, and pay for a key to unlock extra features. Why should hardware be any different? So the law would somehow require any feature on a hardware product to have some physical difference and not be purely a software limitation? What is the advantage of that? Just increases cost to the manufacturer (which will get passed down), then also precludes any possibility of upgrades by purchasing a software patch.

amelius · on Oct 11, 2020

It should be clearly communicated and never be misleading.

daxfohl · on Oct 12, 2020

What's misleading? You get the functionality you pay for. The fact that software controls that is an implementation detail.

amelius · on Oct 12, 2020

As mr_toad says, the software is essentially rented.

This means I'm not buying but renting, which is not how it is advertised.

bdowling · on Oct 11, 2020

> I think that's misleading then, because when I buy a GPU, they make me believe I own it, when, apparently, in reality I don't.

If you buy a GPU you own it and the copy of the software it came with. You are free to use that combination as you choose, forever.

It’s not renting because you don’t have to pay rent to continue to use it. There may be software license restrictions, typically against modifying or reverse-engineering the software, However, it is an error to say that those license restrictions convert your ownership into anything like a rental agreement.

Some digital activists say that we don’t really own the devices that we buy because of license restrictions or restricted device firmware. It’s hyperbole. We do own our devices and the copies of the software they came with, even if they came with artificial limitations.

ClumsyPilot · on Oct 12, 2020

Lets test this idea of ownership: My phone auto-updates, and the manufacturer prevents me from reverting updates. One update has removed my ability to record my calls.

Does that sound like ownership? Can BMW employee pop over to your garage one day remove some bits of the car he thinks you shouldn't have any more?

bdowling · on Oct 12, 2020

The problem of features being changed or removed by a software update is real and the owner can be harmed, as you were. As the owner, however, if you are harmed in that way then you may have a claim against the manufacturer. For example, in a recent class action case by PlayStation 3 owners against Sony over the removal of the Linux OS feature, the court seemed to agree that owners were entitled to damages because Sony ended up paying millions of dollars to class members in a settlement. If you or the PlayStation 3 owners were not owners, then you wouldn’t have a good claim.

ClumsyPilot · on Oct 12, 2020

By the sounds of it playstation 'owners' were paid compensation, but could not get the Linux feature back, in other words they were not made whole. They don't control what is happening to their properly, and without Sony's agreement they cannot repair damage done by sony to it.

That does not sound like ownership to me - again, think back to car ownership. Firstly tampering with your car would have been criminal damage.

Secondly, BMW does not get a say in how you use your car. They can't stop you going over the speed limit. You could get your car fixed without having to involve BMW or going to court to force their hand.

In my view this Sony case looks like compensation for breach of a lease-like contract.

bdowling · on Oct 12, 2020

> By the sounds of it playstation 'owners' were paid compensation, but could not get the Linux feature back, in other words they were not made whole.

Members of the class could opt out of the settlement and sue Sony individually. A court could theoretically enjoin Sony to restore the feature for those individual plaintiffs, but the plaintiffs would have to show that monetary damages would be insufficient. Generally courts don’t like to force defendants to do things when paying money would be an acceptable outcome.

> In my view this Sony case looks like compensation for breach of a lease-like contract.

I haven’t read the complaint in that case but the plaintiffs probably alleged a breach of the implied covenant of good faith and fair dealing. So, yes, possibly a breach of contract claim but not a lease. (Note: A lease is a specific form of contract in which a lessor transfers possession of property to a lessee, but retains a future interest in the property after the contract term ends.)

qayxc · on Oct 12, 2020

Your idea of ownership is way too primitive and doesn't reflect reality.

You do NOT own the software that comes with your GPU!

Ownership implies the ability to transfer, modify, and resell, none of which are within the rights granted by the license of said software.

It's not "rental" either - it's licensing. You don't have to become a lawyer, but knowing and understanding the difference between proprietorship (ownership) and possession is a good start. Same goes for renting vs. licensing vs. ownership.

TL;DR you do not have ownership of any software that came with any device you bought and it's not hyperbole at all.

bdowling · on Oct 12, 2020

> You do NOT own the software that comes with your GPU!

When you purchase a consumer GPU that comes with software, you acquire the GPU, the copy of the software it came with, and a license to use the software subject to particular terms and conditions. That is what you own, no more, no less.

swebs · on Oct 12, 2020

You do own it. And you're free to use the open source drivers if you want.

bdowling · on Oct 12, 2020

> When you “buy” software, you are actually entering into a lease contract to use the software

This is inaccurate, at least as to purchases software. A license is not a contract because the licensee is not required to do anything. A license can have conditions (restriction), but not covenants (promises to do something). A license basically functions as a defense against a claim of infringement.

Note: For purchased software there is a contract for the sale of the software subject to the license, but that shouldn’t be confused with the license itself.

qayxc · on Oct 12, 2020

> For purchased software there is a contract for the sale of the software subject to the license, but that shouldn’t be confused with the license itself.

That's simply not true. You are indeed making a contract for the sale of the license itself. Otherwise subscription models wouldn't work and would even be legally allowed to share and resell the software, which you aren't (i.e. just because it's possible to resell an acquired license while keeping a working copy, doesn't make it legal to do so).

bdowling · on Oct 12, 2020

I agree with you. My earlier point was that a license is not a contract, and shouldn’t be confused with one. My note at the end was that there is also a contract when you acquire a license through a purchase. The contract is typically of the form “you pay us money, we give you license”. That contract too shouldn't be confused with the license acquired.

As you correctly point out, one who sells his only license to a piece of software no longer has a license. If he kept a copy of the software and continues to use it, he is committing an act of infringement. That is the same whether the license is for a term (subscription) or perpetual.

jonas21 · on Oct 11, 2020

Keep in mind it's the same special status that allows the GPL to have the condition that you must release your source code if you distribute something that includes GPL code. So, "reversing" it would also reverse the GPL.

Majromax · on Oct 12, 2020

Not exactly. The GPL's special status generally comes from the fundamentals of copyright law: it attaches conditions to the duplication, modification, and distribution of a work. If not for the GPL, you'd have no right to distribute something containing the copyrighted code.

The datacenter-versus-personal conditions of NVidia drivers attach instead to the use of the copyrighted work. These restrictions are based on the idea of an end user license agreement as an enforceable contract, either agreed-upon when the driver is downloaded or through a theory that copyright attaches to the temporary (in-memory) copy of the driver necessary to run it.

jiggawatts · on Oct 11, 2020

Yes, maybe, but step #1 is to sue Oracle.

See if you can convince them to "let you" publish a benchmark of their database management system.

Start there.

Retric · on Oct 11, 2020

Amusingly, Oracle is known as the slowest major DB despite their heavy handed tactics. So, actual benchmarks might actually help their sales rather than people simply assuming it’s unacceptably slow.

mcny · on Oct 12, 2020

Did this change recently? I remember my database professor in college was adamant that when they talk about databases, I am to assume some things as a given (going by memory, am probably not completely accurate):

that the data set is large enough that cannot fit in memory

that storage is orders of magnitude slower than memory and memory is orders of magnitude slower than processor cache

Oracle has the “best implementation” given these constraints.

Is that not the case?

PeterisP · on Oct 12, 2020

It's worth noting that in current conditions the assumptions may be unwarranted.

First, while storage used to be orders of magnitude slower than memory, not SSD storage is just a single order of magnitude slower;

Second, in many domains now it's often practical to ensure that your data set can fit in memory. For example, if your system is for storing financial transactions (which is a prime market for Oracle), then your enterprise has to be quite large to get a terabyte of transactions and you can put a terabyte (or much more) of RAM in a database system if you choose to.

AtlasBarfed · on Oct 12, 2020

That's precisely the point, Oracle forbids benchmarking and comparisons in its licensing.

So how would anyone (legally) know?

anticensor · on Oct 12, 2020

Well, you cannot legally publish a benchmark, but you can set up your own for your private uses. It is not like Oracle DB detects it is being benchmarked and shuts off itself.

nexuist · on Oct 11, 2020

It's not a special status, anyone has the right to deny you a hardware product as well. I don't have to sell cars to anyone if I don't want to. If I do want to sell someone a car, I can specify a contract or license that they must follow if they buy my car. Ferrari famously only sells exclusive models to customers who have been pre-approved, i.e. they have a certain amount of income and own 5+ Ferraris already. I also cannot walk into a Lockheed Martin dealership and tell them to sell me a F-22, even if I can afford it, even if my country has permissive laws regarding the ownership of fighter aircraft.

As for software, well, EA has the right to ban me from their servers if I hack their games, even if I did pay for the product, and this makes sense because it ruins everyone else's experience. I don't pay for HN but if I did they still would have a right to ban my account if I start posting slurs or other abusive content.

Is it desirable? Of course it's desirable; imagine having no control over your own creations and having to deal with the consequences of other people abusing it.

laumars · on Oct 11, 2020

None of those examples are equivalent. The hardware examples are where companies refuse to sell a product (so you never own the product to begin with) where as the EA example is where you’ve been kicked off online services (you still have the capability to play the game offline, you just can’t access their servers, but you don’t buy their servers when you buy the game) and the HN example is a termination of subscription. Neither of those examples demonstrate legal limitations to software usage with a product you own (though the EA one at least comes close from a superficial perspective).

nexuist · on Oct 12, 2020

> you still have the capability to play the game offline

EA famously uses online-only DRM in many of their modern titles; if you get banned from, say, SimCity, you can't run the game at all. There is no "offline mode".