🎉 Celebrating 25 Years of GameDev.net! 🎉

Not many can claim 25 years on the Internet! Join us in celebrating this milestone. Learn more about our history, and thank you for being a part of our community!

Back to General and Gameplay Programming

Double to float C++

Joe · 2024-05-23T06:07:37

taby said:Yes, Mach's principle is what you're looking for. Looking that up i see they still try to confirm it with experiments. Little do we know… taby said:Edit: I'm giving up. Well, maybe god will tell us more about the universe after we die.At least the few things he knows about it. ; ) taby said:I'm sure that you'll figure it out, after more thought. Sadly the solution is to be more conservative about the upper bound on how far a local change can spread through my geometry.If i move an object, initial geometry only changes locally. But after that i calculate smooth crossfield, wavefield, remesh, hierarchy, and materials is still in front of me. With each processing pass i do, the change can affect a larger distance, because each pass considers hierarchical adjacency.The result is that i need to update a large area even for small changes. Maybe it does not matter my progress so slow. To make this practical for production, very powerful HW is needed to keep waiting times acceptable. I depend on the progress of the HW guys i criticize so much. ; )

General and Gameplay Programming Programming

Started by taby May 13, 2024 04:27 PM

181 comments, last by JoeJ 2 weeks, 2 days ago

JoeJ

4,261

May 20, 2024 08:51 PM

taby said:
Any thoughts on how to achieve snapping-to without casting?

double plank = 0.0000000000000000000000000001;
double quantizedValue = round(value / plank) * plank;

But no. Nature does not do this. It can't be. : )

taby

1,509

Author

May 20, 2024 11:01 PM

I tried your code, as well as a code that is similar. They do not reproduce the results.

https://stackoverflow.com/a/70221868/3634553?stw=2

Hmm…

Thank you joej!!!!

taby

1,509

Author

May 20, 2024 11:56 PM

This works, but it's still implicitly casting:

double truncate_normalized_double(const double d)
{
	float a = d - numeric_limits<float>::epsilon();
	float b = d + numeric_limits<float>::epsilon();

	float r1 = abs(d - a);
	float r2 = abs(d - b);

	if (r1 < r2)
		return static_cast<double>(a);
	else
		return static_cast<double>(b);
}

taby

1,509

Author

May 21, 2024 12:56 AM

This also works. I do the cast from double to float, which is not absolutely necessary – I could convert them using stringstreams manually, which is just slow.


double truncate_normalized_double(const double d)
{
	if (d <= 0.0)
		return 0.0f;
	else if (d >= 1.0)
		return 1.0f;

	float df = d;

	float tempf = nexttowardf(1.0f, df);

	while (tempf > df)
		tempf = nexttowardf(tempf, df);

	return static_cast<double>(tempf);
}

taby

1,509

Author

May 21, 2024 01:16 AM

As you can imagine, I’m sure, that this problem vexes me to no end. LOL

Thanks to everyone who had input here!

taby

1,509

Author

May 21, 2024 01:25 AM

I need a number library that lets you specify the bits. ttmath uses words, like 64-bit words on x64 architecture. i need to specify the exacf number of bits.

JoeJ

4,261

May 21, 2024 06:40 AM

taby said:
I need a number library that lets you specify the bits.

Do it yourself:

float value = float(PI);
			for (int shift = 0; shift < 23; shift++)
			{
				uint32_t bits = (uint32_t&) value;
				uint32_t mantissa = bits & 0x007FFFFF;
				uint32_t signExp = bits & ~0x007FFFFF;
				mantissa &= -1<<shift;
				bits = (mantissa | signExp);
				float reduced = (float&) bits;
				ImGui::Text("PI reduced by %i bits: %f mantissa: %x", shift, reduced, mantissa);
			}

The idea is to extract mantissa and zero out n right most bits.

You could do this for any double after reading up how many bits it uses for mantissa and using uint64 ofc.

JoeJ

4,261

May 21, 2024 06:44 AM

My code was over complicated. No need to mask out mantissa. Same result:

for (int shift = 0; shift < 23; shift++)
			{
				uint32_t bits = (uint32_t&) value;
				bits &= -1<<shift;
				float reduced = (float&) bits;
				ImGui::Text("PI reduced by %i bits: %f", shift, reduced);
			}

taby

1,509

Author

May 21, 2024 02:36 PM

I tried this, but it doesn't work. Any ideas what I'm doing wrong?

Edit: I updated the code… still doesn't work though. It only starts working like halfway through.

#include <iostream>
#include <iomanip>
using namespace std;

int main(void)
{
	cout << setprecision(20) << endl;

	const double pi = 4.0 * atan(1.0);

	uint64_t mantissa_size = 52;

	for (uint64_t shift = 0; shift < mantissa_size; shift++)
	{
		uint64_t bits = (uint64_t&)pi;
		bits = bits & (4294967295 << shift);
		double reduced = (double&)bits;
		cout << shift << " " << reduced << endl;
	}

	return 0;
}

🎉 Celebrating 25 Years of GameDev.net! 🎉

Double to float C++

Popular Topics

Recommended Tutorials

🎉 Celebrating 25 Years of GameDev.net! 🎉

Double to float C++

Popular Topics

Recommended Tutorials

Reticulating splines