[FRIAM] Alignment (with what?)

steve smith sasmyth at swcp.com
Wed Jun 4 12:27:17 EDT 2025


On 6/4/25 9:43 AM, Marcus Daniels wrote:
>
> Really?   The kind of biases I don’t like seem to be system prompting 
> preferences.   For example, when an idea is marginal, ChatGPT will 
> kick the can forward to keep the conversation going.  Perhaps the 
> instinct is to create a demand signal or “billable hours”.  Gemini Pro 
> will tend to be cautious.    To use them effectively one needs to have 
> some self-discipline.  I don’t see these tools doing obvious motivated 
> reasoning like people do.   ChatGPT will especially follow your lead, 
> which can be a bad lead; it is a people pleaser.
>
I agree that the chatbot interface presented to GPT (and others possibly 
less acutely) is a sycophant biased to both please humans and encourage 
our continued engagement.
>
>
> My latest “wow” moment:   I was checking up on the Kerch bridge attack 
> and took a few frames from the video and pasted them for ChatGPT.  I 
> asked if the explosion suggested poor placement by Ukrainian frogmen.  
> It explained that blast off to the side would be typical and suggested 
> an attempt to displace the pier laterally deep under the water.  It 
> did not imply an attempt to knock the bridge down but rather to make 
> it unsafe to use, while also leading to incremental structural failure.
>
So to say that it reflected the (accurate?) bias of such an operation to 
try to maximize effect by "satisficing" by focusing on "making it 
unsafe" rather than wasting tactical resources on more complete 
destruction?

I suppose it would be too much to ask for it to cause distortions in the 
bridge which selectively modify it to become useful only for Ukrainian 
goals vs Russian or perhaps civilian vs military? By derating it's 
functionality marginally perhaps it can handle medium-load commercial 
transport but risk failure under heavier military (e.g.moving tanks) use?


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://redfish.com/pipermail/friam_redfish.com/attachments/20250604/e6450731/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: OpenPGP_0xD5BAF94F88AFFA63.asc
Type: application/pgp-keys
Size: 3118 bytes
Desc: OpenPGP public key
URL: <http://redfish.com/pipermail/friam_redfish.com/attachments/20250604/e6450731/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: OpenPGP_signature.asc
Type: application/pgp-signature
Size: 840 bytes
Desc: OpenPGP digital signature
URL: <http://redfish.com/pipermail/friam_redfish.com/attachments/20250604/e6450731/attachment.sig>


More information about the Friam mailing list