I challenge anyone to try building a C compiler without a big suite of tests. Zig is the most recent attempt and they had an extensive test suite. I don't see how that is disqualifying.
If you're testing a model I think it's reasonable that "clean room" have an exception for the model itself. They kept it offline and gave it a sandbox to avoid letting it find the answers for itself.
Yes the compression and storage happened during the training. Before it still didn't work; now it does much better.
The point is - for a NEW project, no one has an extensive test suite. And if an extensive test suite exists, it's probably because the product that uses it also exists, already.
If it could translate the C++ standard INTO an extensive test suite that actually captures most corner cases, and doesn't generate false positives - again, without internet access and without using gcc as an oracle, etc?
I didn't personally experience it (I was too young), but I think that was part of "the mission" since pre-9/11. The point of the ID check is to make sure the boarding ticket and ID match.
whatever it is, I can't remember the last time something like this took the internet by storm. It must be a neat feeling being the creator and watching your project blow up. Just in a couple weeks the project has gained almost 100k new github stars! Although to be fair, a ton of new AI systems have been upsetting the github stars ecosystem, it seems - rarely actually AI projects, though, seems to just be the actual systems for building with AI?
Ah yes, and convincing friends/family/partners to use Signal instead of Whatsapp clearly what will convince them is that they need to setup, acquire, and use cryptocurrency to register or connect with me on the encrypted messaging service. "No thanks, I just use Whatsapp/iMessage. I heard they're actually e2e encrypted too, so what's the problem?"
Management then cared that their one chance would work. Today management just wants it to mostly work.
Incentives and goals are very different between the two. We could very much build even more incredible things today; and would argue that we actually do. Just only in the places that seem to matter enough to do that type of special effort for.
Unfortunately it simply is true. The "same car" in another country is made cheaper. There are a variety of ways to do it and often it's justified as better than the alternative. For instance in India the goal is getting people off of motorcycles as that is a huge cause of driving deaths. To do that they remove various airbag systems, auto braking systems, and etc. which are not required in India... but are required in America or the EU.
Even between the EU and America there are differences in regulation with the EU often getting the stronger regulation first.
I get it, but part of the reason they're on motorcycles is cars are too expensive for them. Would making it safer and costing $20-30k base like it does in America be more or less ethical?
I don't think there is a clear answer. Safer car means owners of the car survive more and more often: which is unequivocally great. Making it cheaper means there are more people in a safer vehicle than before which can reduce their likelihood of getting hurt/dying, but they're not as protected as they could be. This is, on the whole, also really good.
It just so happens car makers pick to make it cheaper. It's good for business and good for those who couldn't afford it before. It is bad for those who encounter the situations where additional protection would have saved them.
If you're testing a model I think it's reasonable that "clean room" have an exception for the model itself. They kept it offline and gave it a sandbox to avoid letting it find the answers for itself.
Yes the compression and storage happened during the training. Before it still didn't work; now it does much better.