This is incorrect as was shown last year with the Skill-Mix research:
Furthermore, simple probability calculations indicate that GPT-4’s reasonable performance on k=5 is suggestive of going beyond “stochastic parrot” behavior (Bender et al., 2021), i.e., it combines skills in ways that it had not seen during training.
This seems like it may be at the provider level and not at the actual open weights level: https://x.com/xlr8harder/status/1883429991477915803
So a “this Chinese company hosting a model in China is complying with Chinese censorship” and not “this language model is inherently complying with Chinese censorship.”