Brenda, your writing is as useful and illuminating as ever! Thanks for your definitions and explanations. This helps me out a lot in a funny way.
Karen asked me to contribute to this thread, because she knows that I used to expound the "variable reinforcement schedule". I was just going to pop on over here and say.. oh well.. I'm not really a good person to ask because I gave up using it and since I did, training is going so much better for Sunrise and me".
And that brought up the question for me, of WHY, if variable reinforcement is so effective, (as I believe in theory it must be) training has improved since dispensing with it. Curious!
Now, reading your explanations, I realise I haven't stopped using it at all, I'm just using it in a different way, which I'm obviously better at, and am integrating variability into our training without even being aware of it. Wow! Cool!
Quote:
So if I keep clicking stride, stride, HOP too many times, it will be more and more difficult ot extinguish that step in the future and get say 3 or 4 strides or 20, etc
Yes yes! This was one of the problems I touched on in my diary last night. In teaching ramener, because my imagination at that time wouldn't furnish me with alternative things to teach to lead to the whole, I dwelled too long on reinforcing her first incorrect attempts. Now it's very difficult to extinguish.
Other tasks that I've taught since have progressed much better..and I know what it is that I am doing better.. I just hadn't made the connection and realised that this IS variable reinforcement.
Brilliant! So now, I still capture and reinforce the first incorrect attempts to give her encouragement.. but then I ask for something more, or different.. so the reinforcement comes slower. And I can throw in some simple secondary task to up the rate of reinforcement again if she becomes frustrated or lost.
Thanks for enlightenment!
I wonder though if it was this that Karen was asking about, or the method of VR used in Bridge and Target training, which takes the same principle and applies is differently.
It is this that I don't use any more.
This method is to only reinforce a certain percent of the Terminal Bridges.. (the "clicks".), and to do it randomly, so that the horse will not know which click (or other TB ) will lead to a reinforcer. You still use encouragers, or Intermediate Bridge in the same way.
I know that this method works, as many people are successfully using it. I am not one of them.
I found that it was just too confusing for me and the horse.. and unneccessary, because I have IB or keep going signals that can create a similar effect without the confusion.
For example, in BandT.. you might
Cue Click
Cue Click
Cue Click Treat
Cue Click
Cue Click Treat
Cue Click
Cue Click
Cue click
Cue Click Treat
Cu Click
Cue Click treat.
In CT you can
Cue good good
Cue Click treat
Cue good good good
Cue good good
Cue Click treat
Etc.
In B and T you are supposed to begin with the random reinforcement right at the start of training, as soon as your horse has connected the bridge with the reward.
I was struggling with a lot of things when I was trying to use this method, so maybe it was not a very fair trial.. I'm sure not! B and T is not so easy to learn as there's little information out there being shared.. and unless you can get to a clinic, I think much is a mystery.
It did initially cause her to try extra hard to get the "right" click, but I found that it lead to a lot of frustration as well (probably from my own ineptness in using it,)and I did find that Sunrise seemed to be doubting the bridge signal, and often choosing to do her own thing rather than hang about for carrots which she loves.
Immediately that I gave it up, and started reinforcing wildly (something I picked up from watching your video of paint drying Karen.
) I suddenly found far greater enthusiasm and clarity myself, and Sunrise LOVED it.. and we've never looked back.
And since then I've been puzzling over why I've got a so much better motivated horse when in theory she should be complacent..
Thanks Brenda for some food for thought.
Karen, you said:
Quote:
To this point, I still basically reward in the exact same way and nearly the exact same frequency with any given behavior as I do when it's at it's very beginning stages. I may have more duration, but the rewards come on a very regular and expected basis.
I'm wondering, if you have more duration, how can the rewards be coming at at the same frequency?
Are you not able to just add in some good good goods between clicks, and slowly dispense with some of the clicks for old well learned and everyday behaviours?
For example, you must have clicked and treated each step when you first asked Tam to walk with you, but now I'm sure you don't click every step.. you're asking for something else and clicking that.
I think this question you're asking is one a few of us are struggling with at the moment. When and how do we start just expecting our horses can perform certain things without needing continual reinforcement.. move on to adolescence and adulthood.
Did you see that nice little video clip that I found and posted a while ago of a clicker trainer working with a well trained horse? It was exactly this they were trying to address.
Cheers,
Sueh
_________________
I have not sought the horse of bits, bridles, saddles and shackles,
But the horse of the wind, the horse of freedom, the horse of the dream. [Robert Vavra]