BACK TO BEHAVIOR

 

B.E.S.T. slave Training

classical & Operant conditioning

                                                                                

Classical Conditioning (respondent conditioning)

Response behavior is reflexes or automatic responses that are caused by stimuli. Response behavior can be changed (conditioned) through learning. 

Ivan Pavlov (1849 – 1936) studied the link between a stimulus and the corresponding response.  He said that when an organism comes to associate a neutral stimulus with a meaningful one, the organism then responds to the neutral stimulus as if it were the meaningful one.

Pavlov’s dog experiment – A dog learns to associate the sound of a bell (neutral stimulus) with food (meaningful stimulus) and begins responding to the bell as though it were food. The bell becomes a conditioned stimulus and produces a conditioned response.

In short, we learn to associate or link a stimulus to a response.

Classical conditioning is used by trainers for two purposes:

1) To condition (train) automatic responses without using the stimuli that would naturally create such a response; and,

2) to create an association between a stimulus that normally would not have any effect on the animal and a stimulus that would.

 

Operant Conditioning (instrumental conditioning)

  BACK TO BEHAVIOR 

B. F. Skinner (1904 – 1990) said that an organism learned that a particular behavior produced a particular response (consequence). If the response is useful or pleasurable the organism will tend to repeat the behavior to get the response. If the response is not useful or unpleasant the organism will tend not to repeat it.

In short, we learn to associate or link a behavior to a particular response that is produced by that behavior.

B. F. Skinner (1965) defined positive reinforcement as adding something positive to the situation and negative reinforcement as removing something negative (p.73).  He also observed that the two forms of punishment consist of adding a negative or taking away a positive (p.185).  Both sets of concepts are depicted in Table 3.  (Parents should have no trouble identifying them, respectively, as praising, slacking off, spanking, and grounding.)

Classical conditioning forms an association between two stimuli. Operant conditioning forms an association between a behavior and a consequence. (It is also called response-stimulus or RS conditioning because it forms an association between the animal's response [behavior] and the stimulus that follows [consequence])  Consequences have to be immediate, or clearly linked to the behavior. With verbal humans, we can explain the connection between the consequence and the behavior, even if they are separated in time.

General Principles

There are 4 major techniques or methods used in operant conditioning. They result from combining the two major purposes of operant conditioning (increasing or decreasing the probability that a specific behavior will occur in the future), the types of stimuli used (positive/pleasant or negative/aversive), and the action taken (adding or removing the stimulus).  Restated, the  four possible consequences to any behavior. They are:

1) Something Good can start or be presented, so behavior increases ;   Positive Reinforcement (R+)
2) Something Good can end or be taken away so behavior decreases;
Negative Punishment (P-)
3) Something Bad can start or be presented so behavior decreases;
Positive Punishment (P+)
4) Something Bad can end or be taken away, so behavior increases.
Negative Reinforcement (R-)

Technical Terms

The technical term for "something started" or "presented" is positive, since it's something that's added to the environment.

The technical term for "ended" or "taken away" is negative, since it's something that's subtracted from environment.

Anything that increases a behavior - makes it occur more frequently, makes it stronger, or makes it more likely to occur - is termed a reinforcer. Often, a person will perceive "starting Something Good" or "ending Something Bad" as something worth pursuing, and they will repeat the behaviors that seem to cause these consequences. These consequences will increase the behaviors that lead to them they are reinforcers. These are consequences the slave will work to attain, so they strengthen the behavior.

Anything that decreases a behavior - makes it occur less frequently, makes it weaker, or makes it less likely to occur - is termed a punisher. Often, a person will perceive "ending Something Good" or "starting Something Bad" as something worth avoiding, and they will not repeat the behaviors that seem to cause these consequences. These consequences will decrease the behaviors that lead to them they are punishers.

Positive Reinforcement  (getting something pleasant)

Positive reinforcement means starting or adding Something Good, something the slave likes or enjoys. Because the slave wants to gain that Good Thing again, it will repeat the behavior that seems to cause that consequence.  Good behavior will be rewarded.

Reinforcement doesn't work if:


the "reinforcer" is not reinforcing -- (e.g., a single reinforcer won't work for all slaves; can be a case of too much of a good thing,


the reinforcement is not consistent -- continuous, consistent reinforcement brings about more rapid behavior change than intermittent reinforcement; a little reinforcement early is better than a lot of reinforcement later


the individual loses too much, or gains too little, by changing a behavior -- people constantly do "cost/benefit analysis" kinds of evaluations, and the change in behavior has to be "worth it" in the mind of the person changing the behavior


too much is expected too soon -- shaping proceeds too rapidly

You reinforce behavior, there are 3 important conditions:

The reinforcer must follow the response.
The reinforcer must follow immediately.
The reinforcer must be contingent on the response.

Timing, magnitude, and consistency of reinforcement all can affect the rate at which new behaviors are learned.

Negative Punishment (being deprived of something pleasant)

Negative punishment is reducing behavior by taking away Something Good. If the slave  was enjoying or depending on Something Good she will work to avoid it getting taken away. she is less likely to repeat a behavior that results in the loss of a Good Thing. This type of consequence is a little harder to control.

Not all of what we do that is punished is physically painful. Often, the consequences of our behavior are that we loose something of value to us. This type of punishment is especially common when we look at the sanctions that society places on our behavior that it deems inappropriate.  (speeding ticket -- loss of money and maybe loss of driver's license if it continues)

Positive Punishment (receiving something unpleasant)

Sometimes we do things that cause us to experience pain, reducing the probability of us doing that again. You put your hand on a hot stove, and you get burned.

Positive punishment is something that is applied to reduce a behavior (spanking). The term "positive" often confuses people, because in common terms "positive" means something good, upbeat, happy, pleasant, rewarding. Remember, this is technical terminology we're using, though, so here "positive" means "added" or "started". Also keep in mind that in these terms, it is not the slave  that is "punished" (treated badly to pay for some moral wrong), but the behavior that is punished (reduced). Positive punishment, when applied correctly, is the most effective way to stop unwanted behaviors. Its main flaw is that it does not teach specific alternative behaviors.

Because a positive punisher, like other consequences, must follow a behavior immediately or be clearly connected to the behavior to be effective, a secondary positive punisher is very important.

Negative Reinforcement (taking away or avoiding something unpleasant)

Negative reinforcement increases a behavior by ending or taking away Something Bad or aversive. By making the animal's circumstances better, you are rewarding it and increasing the likelihood that it will repeat the behavior that was occurring when you ended the Bad Thing.

In order to use negative reinforcement, the trainer must be able to control the Bad Thing that is being taken away. This often means that the trainer must also apply the Bad Thing. And applying a Bad Thing might reduce whatever behavior was going on when the Bad Thing was applied.

It is divided into two categories; escape and avoidance.

Escape something unpleasant is currently going on, and the organism does something to terminate or lessen it.

Avoidance
has occurred when behavior has prevented the onset of an impending increase in the aversiveness of the environment. They have in common that both are instances of behavior leading to an improvement in the environment (hence reinforcement) by removing some amount of the aversiveness of the environment (therefore negative). As is always the case with reinforcement, life is better for the behavior having occurred, therefore increasing the probability of the behavior in the future under similar circumstances.
 

 

EFFECTS OF BEHAVIOR

SYMBOLS

TECHNICAL TERMS

Adds Positive Conditions

Subtracts Negative Conditions

(+ +)

(-  -)

Positive Reinforcement

Negative Reinforcement

Adds Negative Conditions

Subtracts Positive Conditions

(+ -)

(- +)

Positive Punishment

Negative Punishment

The table above is read as follows:

  1. If the effects of behavior add positive conditions, that is a plus-plus, and the technical term is positive reinforcement.
  2. If the effects of behavior subtract negative conditions, that is a minus-minus, and the technical term is negative reinforcement.
  3. If the effects of behavior add negative conditions, that is  a plus-minus, and the technical term is positive punishment.
  4. If the effects of behavior subtract positive conditions, that is a minus-plus, and the technical term is negative punishment.

or:

 

Reinforcement
 

(behavior increases)

Punishment
(behavior decreases)
Positive
(something added)
Positive Reinforcement:
Something added increases behavior
Positive Punishment
Something added decreases behavior
Negative
(something removed)
Negative Reinforcement
Something removed increases behavior
Negative Punishment
Something removed decreases behavior

 

Analyzing Examples of Operant Conditioning

There are five basic processes in operant conditioning: positive and negative reinforcement strengthen behavior; punishment, response cost, and extinction weaken behavior.

 1) Positive Reinforcement--the term reinforcement always indicates a process that strengthens a behavior; the word positive has two cues associated with it. First, a positive or pleasant stimulus is used in the process, and second, the reinforcer is added (i.e., "positive" as in + sign for addition). In positive reinforcement, a positive reinforcer is added after a response and increases the frequency of the response.

2) Negative Reinforcement-- the term reinforcement always indicates a process that strengthens a behavior; the word negative has two cues associated with it. First, a negative or aversive stimulus is used in the process, and second, the reinforcer is subtracted (i.e., "negative" as in a "-" sign for subtraction). In negative reinforcement, after the response the negative reinforcer is removed which increases the frequency of the response. (Note: There are two types of negative reinforcement: escape and avoidance. In general, the learner must first learn to escape before he or she learns to avoid.)

3) Response Cost--if positive reinforcement strengthens a response by adding a positive stimulus, then response cost has to weaken a behavior by subtracting a positive stimulus. After the response the positive reinforcer is removed which weakens the frequency of the response.

4) Punishment--if negative reinforcement strengthens a behavior by subtracting a negative stimulus, than punishment has to weaken a behavior by adding a negative stimulus. After a response a negative or aversive stimulus is added which weakens the frequency of the response.

5) Extinction--No longer reinforcing a previously reinforced response (using either positive or negative reinforcement) results in the weakening of the frequency of the response.

The following questions can help in determining whether operant conditioning has occurred.

a. What behavior in the example was increased or decreased?

b. Was the behavior increased (if yes, the process has the be either positive or negative reinforcement), or decreased (if the behavior was decreased the process is either response cost or punishment).

c. What was the consequence / stimulus that followed the behavior?

d. Was the consequence / stimulus added or removed? If added the process was either positive reinforcement or punishment. If it was subtracted, the process was either negative reinforcement or response cost.


People will move toward new and different behaviors if they view these new and different behaviors as:

  • adding new positive conditions,
  • preserving existing positive conditions,
  • avoiding new negative conditions, and
  • eliminating existing negative conditions.

Typically, people will shy away from new and different behaviors if they view these new and different behaviors as:

  • adding new negative conditions,
  • preserving existing negative conditions,
  • avoiding new positive conditions, and
  • eliminating existing positive conditions.

 How to stop an bad behavior:

Three methods: extinction, differential reinforcement of other behaviors, and reinforcement of other behaviors.


extinction -- making sure that a particular response no longer leads to reinforcement (e.g., getting  attention through bas behavior


differential reinforcement of other behaviors -- called DRO -- is a procedure whereby an organism is reinforced for not exhibiting a particular behavior during a specified time interval reinforcing incompatible responses {not as widely followed these days?} (e.g.,   Master praises a slave who behaves properly)


reinforcing incompatible behaviors -- reinforcement of a behavior incompatible with the undesired response

punishment --

(1) presentation of an aversive stimulus (e.g., spanking) and

(2) removal of a pleasant stimulus (e.g., not being allowed to watch TV). There is a lot of argument about whether punishment works or not. Punishment only suppresses the undesired behavior, so it's best coupled with an alternative positive behavior which you hope will replace the undesired behavior.

Differences between the two

The distinction between Classical behavior and operant behavior is that respondent behavior is caused by stimuli where as operant behavior is emitted or freely made by the individual.

While the differences at first blush seem profound, we will eventually see that Pavlovian and operant are two halves of an integrated view of learned behavior--Pavlovian by itself isn't of much value--it sets us up to act, and the action is the operant part of our behavior. By the same token, operant doesn't operate in a vacuum--we need to be motivated to act.

The nature of reinforcement is also different. For respondent behavior the stimulus is the reinforcement. For operant behavior, the effect caused by the behavior is the reinforcement.

Classical conditioning is referred to as stimulus -- response. 

Operant conditioning is referred to as response -- stimulus.

 BACK TO BEHAVIOR  

 

Factors that influence operant conditioning

Julian Rotter stated that behavior modification requires more than a standard set of classical or operant conditioning techniques. He said that individual differences are important in behavioral training. The individual’s thoughts and emotions play a part in behavioral training.  He stated that behavioral potential, expectancy, reinforcement value and psychological situations are factors that should be considered and are measurements for success. 

Behavioral potential – The likelihood that a particular behavior will occur in a given situation.

Expectancy – The slave's expectations about the outcome.

Reinforcement Value – The importance of the reinforcement to the slave.

The Psychological Situation – The definition or perspective of the slave about a behavioral training step.

He also stated that human's behavior is always directional (goal oriented) and determined by needs that behavior can be inferred by the way the individual interacts with the environment.

Through research, Rotter identified six psychological needs:

Recognition-status: the need to be seen as competent in socially valued activities.

Dominance: the need to control the actions of others.

Independence: the need to make one’s own decisions and rely on one's self.

Protection-dependency: the need to have others prevent frustrations or help obtain goals.

Love and affection: the need for acceptance and liking by others.

Physical comfort: learned needs for physical satisfactions associated with security.

Understanding the slave's psychological needs is useful in behavioral training.

Each of the above needs has three basic components.

1) The first component is need potential or the likelihood that that a set of behaviors directed at a goal will be used in a given situation.

2) The second component, freedom of movement, is the expectancy that a set of behaviors will lead to success in meeting a goal. High expectance leads to the anticipation of success.

3) The third component, need value, is the importance an individual places on one goal over other goals.

The concept of minimum goal level refers to the lowest level of potential reinforcement that is perceived as satisfactory in a particular situation. This varies from individual to individual. It is the point where the individual feels positive reinforcement enough to complete the goal.

 BACK TO BEHAVIOR