• Hey, guest user. Hope you're enjoying NeoGAF! Have you considered registering for an account? Come join us and add your take to the daily discourse.

TLOU2 Pro Performance Mode might give us the GPU cost of PSSR rendering.

SlimySnake

Flashless at the Golden Globes
Just saw this gaming tech video. Pros performance mode runs at 1440p internal resolution just like base PS5s performance mode. PSSR then reconstructs the image to 4k. Looks way better than 1440p.

However, the framerate comparison could help us determine just how much the PSSR reconstruction could cost on the GPU. The framerates are roughly in line on both the Pro and PS5 suggesting most of the circa 45% extra horsepower is going into PSSR reconstruction. In other Pro games, we have seen an extra 30% raw gpu performance so it could also be just 30%. In some cases, it dips around 10 fps below the ps5 performance mode like screenshotted below but that could also be a vram bottleneck since the textures and other effects are still rendered at 4k.

Timestamped:



iUPuqi3.jpeg


Regardless, the reconstruction costs are in line with what we've seen from DLSS and FSR.
 

Gaiff

SBI’s Resident Gaslighter
He also compares TAA to PSSR. However, I'm not sure if TAA and PSSR are running the same input and output resolutions.

3xheS7Q.png


 

SlimySnake

Flashless at the Golden Globes
Assuming 35% faster gpu and no cpu bottleneck, ps5 pro without pssr should be getting 111 * 1.35 = 150 or 6.66ms frametime, with pssr its getting 102 or 9.8ms, so pssr takes 3.14ms, that is actually way slower than what I expected, a 3080 takes ~1ms to upscale from 1080p to 4k.
Nah. ive done several tests on this in several different games and it varies from game to game. some games cost 20% while others can go up as high as 35%.
 

winjer

Gold Member
I bet the performance cost of PSSR is around 2-3 ms, just like most temporal upscalers.

The number that Cerny said about 45% more performance is probably a best case scenario.
Most often than not, the Pro will be limited by memory bandwidth.
 

SweetTooth

Gold Member
I bet the performance cost of PSSR is around 2-3 ms, just like most temporal upscalers.

The number that Cerny said about 45% more performance is probably a best case scenario.
Most often than not, the Pro will be limited by memory bandwidth.

Winjer chimed in with his wisdom, those deceiving cucks at Sony knows nothing about hardware design!!

How come you haven't been hired? Cmon Microsoft! Snag this genius

Edit: actually there are multiple geniuses in this thread! What are you doing Microsoft?!
 
Last edited:
Assuming 35% faster gpu and no cpu bottleneck, ps5 pro without pssr should be getting 111 * 1.35 = 150 or 6.66ms frametime, with pssr its getting 102 or 9.8ms, so pssr takes 3.14ms, that is actually way slower than what I expected, a 3080 takes ~1ms to upscale from 1080p to 4k.
Is your 1ms upscaling DLSS? I wonder if the added time could be PSSR not being quite as optimized.
 

Haint

Member
This was pretty apparent when it was first revealed pssr games were largely running at the same res as base PS5, and even dynamic res wasn't really pushing render higher. A very very expensive solution, both in terms of processing and dollar cost ($800 console).
 
Last edited:

Three

Member
I don't know what you mean by % cost.
He means the the 35% differs from game to game so doing a theoretical 1.35 to figure out the delta would not be true and is an assumption. It could be 20% for this game making the delta smaller in your calculation.
 

OverHeat

« generous god »
Winjer chimed in with his wisdom, those deceiving cucks at Sony knows nothing about hardware design!!

How come you haven't been hired? Cmon Microsoft! Snag this genius

Edit: actually there are multiple geniuses in this thread! What are you doing Microsoft?!
WTF.gif seriously what is your problem? Your gonna get mass reported with this attitude.
 
Last edited:

Embearded

Member
It has nothing to do with what silicon. It's added processing time before displaying the image thus it has a time in ms which affects framerate.

Yes, but GPU need wait to the upscaled frame. Unless they make their game full asynchronous, but I really doubt most developers would do

Thanks, I understand that, it takes time in the pipeline of the frame. Some of the wording here confused me and i thought they meant it's an algorithm solution like FSR.
 

Three

Member
Yes, but GPU need wait to the upscaled frame. Unless they make their game full asynchronous, but I really doubt most developers would do
How would you even do this with AI upscaling. Is it possible somehow?
 
Last edited:

Seomel

Member
DF spoke about thid in new direct weekly and they have much better game to judge this by - GOW Ragnarok patch whoch dropped. You can select taa or pssr with unlocked fps, so they were able to get idea of cost of pssr
tldr- around 2ms latency cost, pssr resolves much better image. There isnfps difference but a lot of paces you are like 70-100range and with vrr it doesnt really have big difference in feel
Screenshot-20241110-183355-You-Tube.png
 
Last edited:

zeroluck

Member
How would you even do this with AI upscaling. Is it possible somehow?
Yes, ID tech did it with Wolfenstein Youngblood using async DLSS, DLSS only uses tensor cores, leaving shaders cores idle, if you have other tasks that don't use too much memory bandwidth you can run them on top of DLSS. If PSSR uses something akin to tensor cores then it should also be possible, if not then no.
 
Last edited:

Three

Member
Yes, ID tech did it with Wolfenstein Youngblood using async DLSS, DLSS only uses tensor cores, leaving shaders cores idle, if you have other tasks that don't use too much memory bandwidth you can run them on top of DLSS. If PSSR uses something akin to tensor cores then it should also be possible, if not then no.
I'm not familiar with ID techs method. Doesn't the frame need to be ready before the upscale, how do you do it async?
 

zeroluck

Member
I'm not familiar with ID techs method. Doesn't the frame need to be ready before the upscale, how do you do it async?
ID tech does presentation of frames in compute queue asynchronously, new frames don't need previous frame to finish before rendering. They overlapped upscaling with the rendering of the starting portion of next frame.
 
Last edited:

SpokkX

Member
Yeah, DLSS

YdcuGZ7.png
Nvidia is truly amazing. I use DLSS in every supported game and it always looks better but more importantly runs WAY better. Looking at that chart DLSS cost is almost not worth mentioning anymore - it is almost free. Imagine on 50-series
 

SpokkX

Member
DF spoke about thid in new direct weekly and they have much better game to judge this by - GOW Ragnarok patch whoch dropped. You can select taa or pssr with unlocked fps, so they were able to get idea of cost of pssr
tldr- around 2ms latency cost, pssr resolves much better image. There isnfps difference but a lot of paces you are like 70-100range and with vrr it doesnt really have big difference in feel
Screenshot-20241110-183355-You-Tube.png
So what you get lower fps with PSSR? Is it scaling from the same res as without?
 

TrebleShot

Member
The Performance Pro mode on TLOU is the original Fidelity mode at a lower res then upscaled, so its not the original 1440p mode then upscaled.
It also has better shadows, lighting, models etc.
 

Lysandros

Member
What happens when other optimized native applications even exceeds the stated 45% performance differential, words would be retracted? There are some which are afflicted by the addiction to assume the worst when it comes to Playstation hardware. Don't be consumed by the eagerness to jump into conclusions. Give the system some damned time to prove its capabilities.
 
Last edited:
It's funny how some people are in total denial about the fact that PS5 Pro is not as ‘Pro’ as expected, and is just a PS5 with a bit more raw power (which goes away when using PSSR) and AI rescaling for only $699 (disc drive and stand not included).

Curious where you pulled that knowledge from?? Someplace that stinks I think 🤔
 

kevboard

Member
this is still AMD hardware for the most part, so this wouldn't be a bit a surprise if it turns out to be accurate.

also interesting to note is that apparently in the code of the UE5 plugin for PSSR, PSSR uses nearly the exact same names and terminology for its variables and settings as those that previously were only used by XeSS.

which makes me wonder if PSSR is a modified version of XeSS that simply runs on the shaders. the base PS5 wouldn't be able to support XeSS at decent performance levels as it doesn't have hardware acceleration for DP4a, so an upgrade to a newer version of RDNA was necessary to make it work.
and the additional Compute Units were needed to mitigate the performance impact the DP4a path of XeSS still has when using big upscaling factors like a 4x scale from 1080p to 4k (since the more pixels that need to be created the higher the cost)
 
Last edited:

Lysandros

Member
I just heard from a good friend who works at Sony PlayStation hardware engineering department that (rather bafflingly) employees there use two completely unique terminologies to refer to GPU shaders. One of them is "custom hardware for machine learning." And the other "An actual custom silicon AI upscaler." Well, that would explain everything.
 
Last edited:

PicoLordens

Member
One thing I noticed while playing earlier is that on Pro mode the LOD draw distance is way worse than on fidelity mode. Very easily observable.
 

Mr Moose

Member
this is still AMD hardware for the most part, so this wouldn't be a bit a surprise if it turns out to be accurate.

also interesting to note is that apparently in the code of the UE5 plugin for PSSR, PSSR uses nearly the exact same names and terminology for its variables and settings as those that previously were only used by XeSS.

which makes me wonder if PSSR is a modified version of XeSS that simply runs on the shaders. the base PS5 wouldn't be able to support XeSS at decent performance levels as it doesn't have hardware acceleration for DP4a, so an upgrade to a newer version of RDNA was necessary to make it work.
and the additional Compute Units were needed to mitigate the performance impact the DP4a path of XeSS still has when using big upscaling factors like a 4x scale from 1080p to 4k (since the more pixels that need to be created the higher the cost)
So because it uses similar naming in UE5, it must be XeSS?
 

kevboard

Member
So because it uses similar naming in UE5, it must be XeSS?

it is a bit strange that all other upsampling methods have very distinct naming conventions. this is true for XeSS, DLSS, FSR, and TSR. PSSR is the only one that is this similar to another one, which happens to be XeSS.

so it is curious. and it wouldn't be that far fetched, as Intel already plans to eventually make it entirely open source.
so it wouldn't be surprising if Sony worked with Intel to fork off of it. especially since XeSS is very close to DLSS in quality in many games
 
Last edited:

Mr Moose

Member
it is a bit strange that all other upsampling methods have very distinct naming conventions. this is true for XeSS, DLSS, FSR, and TSR. PSSR is the only one that is this similar to another one, which happens to be XeSS.

so it is curious. and it wouldn't be that far fetched, as Intel already plans to eventually make it entirely open source.
so it wouldn't be surprising if Sony worked with Intel to fork off of it. especially since XeSS is very close to DLSS in quality in many games
Oh yeah, I forgot they were using an Intel GPU.
 

Chuck Berry

Gold Member
Just saw this gaming tech video. Pros performance mode runs at 1440p internal resolution just like base PS5s performance mode. PSSR then reconstructs the image to 4k. Looks way better than 1440p.

However, the framerate comparison could help us determine just how much the PSSR reconstruction could cost on the GPU. The framerates are roughly in line on both the Pro and PS5 suggesting most of the circa 45% extra horsepower is going into PSSR reconstruction. In other Pro games, we have seen an extra 30% raw gpu performance so it could also be just 30%. In some cases, it dips around 10 fps below the ps5 performance mode like screenshotted below but that could also be a vram bottleneck since the textures and other effects are still rendered at 4k.

Timestamped:



iUPuqi3.jpeg


Regardless, the reconstruction costs are in line with what we've seen from DLSS and FSR.


What’s your point Vanessa?
 
Top Bottom