loadrunner correlation

16
Correlation? What’s that? If you think correlation has something to do with the fit of data points to a function curve on a graph, and the word has no meaning to you in the context of Mercury’s tool, then this document is for you. It explains what correlation is, why you have to do it, how to do it, and what to do when it goes wrong. If this is the first time that you have used VuGen or if you have been using it a little but are not a guru, then read on. Introduction VuGen, when recording a script, simply listens to the client (browser) talking to the server (Web server) and writes it all down. The complete transcript of everything that was said, the dates and time, content, requests, and replies can be found in the Recording Log (View Output Window Recording Log). The script is an easier to read version of this. The main difference is that the script only contains the client’s communication. If you imagine that VuGen is an impersonator pretending to be the client (browser), the script tells VuGen what to say to the server to successfully fool it. You want the server to believe that VuGen is a real client, and thus send it the requested information. This script has the hard coded information of the original conversation (Browser session) that occurred between the client and server. This hard coded information may not be enough to fool the server during replay however; it may have to be correlated.

Upload: eveningorbit

Post on 15-Nov-2014

55 views

Category:

Documents


5 download

DESCRIPTION

LoadRunner Correlation Explained

TRANSCRIPT

Page 1: LoadRunner Correlation

Correlation? What’s that? If you think correlation has something to do with the fit of data points to a function curve on a graph, and the word has no meaning to you in the context of Mercury’s tool, then this document is for you. It explains what correlation is, why you have to do it, how to do it, and what to do when it goes wrong. If this is the first time that you have used VuGen or if you have been using it a little but are not a guru, then read on. Introduction VuGen, when recording a script, simply listens to the client (browser) talking to the server (Web server) and writes it all down. The complete transcript of everything that was said, the dates and time, content, requests, and replies can be found in the Recording Log (View � Output Window � Recording Log). The script is an easier to read version of this. The main difference is that the script only contains the client’s communication. If you imagine that VuGen is an impersonator pretending to be the client (browser), the script tells VuGen what to say to the server to successfully fool it. You want the server to believe that VuGen is a real client, and thus send it the requested information. This script has the hard coded information of the original conversation (Browser session) that occurred between the client and server. This hard coded information may not be enough to fool the server during replay however; it may have to be correlated.

Page 2: LoadRunner Correlation

What is correlation? Correlation is where the script is modified so that some of the hard coded values in the script are no longer hard coded. Rather then sending the original value to the server, we may need to send different values. For example, the original recorded script may have included the server sending the client a session identification number, something to identify the client during that particular session. This Session ID was hard coded into the script during recording. During replay, the server will send the client a new Session ID. You need to capture this value, and incorporate it into the script so you can send it back to the server to correctly identify yourself for this new session. If you leave the script unmodified, you will send the old hard coded Session ID to the server. The server will look at it and think it invalid, or unknown, and so will not send the pages that have been requested. Replaying script with old will Session ID not have successfully fooled the server into believing it is a client. Correlation is the capturing of dynamic values passed from the server to the client and back. You save this captured value into a parameter, and then use this parameter in the script in place of the original value. During replay, the replay engine will now listen to what the server sends to it, and when it makes requests of the server, send this new, valid value back to the server; thus fooling the server into believing it is talking to a real client. Why do I have to correlate? If you try to replay a script without correlating first, then most likely the script will fail. The requests it sends to the server will not be replied to. Either the Session ID is invalid so the server will not allow you into the site, or it will not allow you to create new records because they are the same as existing ones, or the server will not understand your request because it is not what it is expecting. Any value that changes every time you connect to the server is a candidate for correlation. A correlated script will send the server the information it is looking for, and so allow the script to replay. This will allow many Vusers to replay the script many times, placing load on your server.

Page 3: LoadRunner Correlation

What errors mean I have to correlate? There are no specific errors that are associated with correlation, but there are errors that could be caused because a value has not been correlated. For example, if an invalid Session ID is sent to a Web server, how that server responds depends on the implementation of that server. It might send a page specifically stating the Session ID is invalid and ask you to log in again. It might send an HTTP 404 Page not found error because the requesting user did not have permissions for the specified page, and so the server could not find the page. In general, any error message returned from the server after the script makes a request that complains about permissions can point to a hard coded value that needs to be correlated. The tools (functions) used to correlate There are 3 functions that you can use for correlation. A list of them, along with documentation and examples, can be found in the on-line documentation. From VuGen, go to Help � Function reference � Contents � Web and Wireless Vuser Functions � Correlation Functions. The first function, web_reg_save_param, is the latest function with some extra features for a more powerful usage. The second and third function, web_create_html_param and web_create_html_param _ex are for backward compatibility. They are essentially the same, and will be talked about together. The last function is associated with the second and third function, and is not directly a correlation function but rather a setting. It will be talked about later in a different section. Web_reg_save_param web_reg_save_param ( “Parameter Name” , <List of Attributes>, LAST ); Each of these parameters is a pointer to a string. That means that if they are entered as literal text, they need to be enclosed in quotes, with each parameter separated by a comma. The supported attributes include LB (Left boundary), RB (Right boundary), RelFrameID, ORD, Search, SaveOffset, SaveLen, and Convert. These attributes can appear in any order because they contain within them what they are. Detail information about each attribute can be found on the function reference online.

Page 4: LoadRunner Correlation

Web_create_html_param web_create_html_param ( “Parameter Name”, “Left Boundary”, “Right Boundary” ); This function is for backward compatibility only. It was a standard for LoadRunner6.5 and below. This function takes three parameters. Each of these parameters is a pointer to a string. That means that if they are entered as literal text, they need to be enclosed in quotes, with each parameter separated by a comma. Parameter Name - This is the name of the parameter, or placeholder variable for saving the captured value. After successfully capturing the value, the parameter name is used in the script in place of the original value. The replay engine will identify the parameter/placeholder, and substitute the captured value for the placeholder during replay. This name should have no spaces, but apart from that limitation, it is entirely up to you what name to give it. Left Boundary - This is where you tell the replay engine how to find the dynamic value you are looking for. In the Left Boundary, you specify the text that will appear to the left of the changing value. Right Boundary - This is where you tell the replay engine how to identify the end of the dynamic value you are looking for. You use the text that will appear after the value. Web_create_html_param_ex web_create_html_param_ex ( “Parameter Name”, “Left Boundary”, “Right Boundary”, “Instance”); This function is the same as the web_create_html_param function, except it does not look for the first instance of the boundaries, but rather the nth instance of those boundaries. The first three parameters (Parameter Name, Left Boundary, and Right Boundary) are the same, the last parameter is a pointer to a string, and so it must be enclosed in double quotes. It is the number of the occurrence. If you place the number 1 here, the function behaves exactly as the web_create_html_param function; it looks for the first occurrence. If you put the number 3 here, it will look for the third occurrence of the left and right boundaries, and place what appears in-between into the parameter.

Page 5: LoadRunner Correlation

Identifying values to correlate So you have the tools, and you know why you need to use them, but how do you know what to use them on? What values in the script need to be correlated? The simplest answer is “Any value that changes between sessions required for the script to replay.” A hypothetical example You are logging onto a website. When you send the server your user name and password, it replies with a Session ID that is good for that session. The Session ID needs to be correlated for replay. You need to capture this value during replay to use in the script in place of the hard coded value. To identify values to correlate, record the script, and save it. Open a new script, and record the same actions and business process again. As much as possible, during recording, enter the same values in both scripts, for example, user ID, password, and fields and edit selections. Save the second script, and run it with Extended log (Vuser � Run-Time Settings � Log � Extended log. Check all three options). Go to Tools → Compare with Vuser, and choose the first recorded script. WinDiff will open and display the two scripts side by side. Lines with differences in them will be highlighted in yellow. Differences within the line will be in red (If not, go to Options � View � Show inline differences.)

Page 6: LoadRunner Correlation

If WinDiff gives an error here, dismiss the error. WinDiff will be minimized in the task bar, right-click on it and choose “Restore.” Go to File → Select Files/Directories and manually select the action sections for the two scripts. Differences like "lr_think_time" can be ignored. They are pacing functions, and do not represent data sent to the server. Locate the first difference, take note of it, and search the script open in VuGen for that difference. That is the original value hard coded into the script that was different in the second script. Highlight it and copy it.

Go to the recording log and place your cursor at the top. Press Control-F (CTRL+F) to do a search and paste in the original value. You are looking for the first occurrence of this value in the recording log. If you do not find the value in the recording log, verify you are looking in the correct script’s recording log. Remember you have two almost identical scripts here. If you find the value, scroll up in the log, and make sure the value was sent as part of a response from the server. The first header you come across while looking up the script should be prefaced by a receiving response. This indicates that the server sent the value to the client. If the value first appears as part of a sending request, then the value originated on the client side and does not need to be correlated, but rather parameterized.

Page 7: LoadRunner Correlation

That is a different topic all together. The response will have a comment before it that looks like this:

*** [tid=b9 Action1 2] Receiving response from host astra.merc-int.com:80 ( 25/11/2002 12:04:00 )

So, you have a value that is different between subsequent recordings, it was sent from the server to the client; this value most likely needs to be correlated. If the value is not different between recordings and not originating on the server and sent to the client then the value probably does not need to be correlated.

Page 8: LoadRunner Correlation

How to correlate Step 1 After confirming that the first occurrence was part of a received response from the server, you need to figure out where to place the web_reg_save_param( ) function. The web_reg_save_param statement needs to go immediately before the request that fetched the dynamic value from the server. In order to find this request or URL in the script, you need to replay the script once with extended log and all the three options (In Vuser → Runtime Settings → Log) turned on. In the Recording log, pick up the text that is before the dynamic value. This text should remain constant no matter how many times you replay the script and highlight it and copy it. This is the text that will identify where to find the start of the value you are capturing.

Page 9: LoadRunner Correlation

Now, go to the Execution log and search for the text that you just copied from the Recording log. You should see a corresponding Action1.c() at the beginning of that line with a number in the brackets. That is the number of the line in the script where you need to put web_reg_save_param( ) function. The function should go right above that line in the script.

So, add a couple of blank lines to your script before the function at that line, and then type in web_reg_save_param(“UserSession”, but give it a name that means more to you than UserSession.

Page 10: LoadRunner Correlation

Step 2 Go back to the recording log, highlight the text to the left of the dynamic value and copy it. This should be some of the same text you searched for in the Execution log. The amount of text you highlight should be sufficient so that it is unique in this reply from the server. It is suggested that you copy as much as possible without copying any special characters. These show in the Execution log as black squares, and the actual character they represent is uncertain. After selecting a boundary, go to the top of the Servers reply, and press CTRL+F and do a search for that boundary. You want to make certain what you have selected is the first occurrence in the Servers reply. If it is not select, use more text to make it unique, or consider using the ORD parameter of the web_reg_save_param function. Once you have finalized the static text that represents the left boundary, copy it into the web_reg_save_param (or web_create_html[_ex]) statement. If it contains any carriage returns, place it all on one line. If there are any quotes (“) in the text, place the escape character before it so that the replay does not incorrectly think it is the end of the parameter, but rather a character to search for. For example, if the left boundary was 'input type=hidden name=userSession value=' (without the single quotes) and you are using the web_reg_save_param statement, then the function you have so far would be web_reg_save_param (“UserSession”, “LB=input type=hidden name=userSession value=”,

Page 11: LoadRunner Correlation

Step 3 You are now going to tell the replay engine how to identify the end of the value you are trying to capture, that is, the right boundary of what you are looking for. Again, look in the Execution log and copy the static text that appears to the right of the dynamic value you are looking for. For example, let’s say the Execution log contained the following: … userSession value=75893.0884568651DQADHfApHDHfcDtccpfAttcf>… Then, the example so far to save the number into the parameter UserSession would be web_reg_save_param(“UserSession”, “LB=input type=hidden name=userSession value=”, “RB=>”,LAST);

In choosing a right boundary, make sure you choose enough static text to specify the end of the value. If the boundary you specify appears in the value that you are trying to capture, then you will not capture the whole value.

Page 12: LoadRunner Correlation

Recap That was a lot of looking through the Recording and Execution logs and checking of values. Let’s just recap what you have done. You have identified a value that you think needs to be correlated. You then identified in the script where to place the statement that would ultimately capture and save the value into a parameter. You then placed the statement, and gave the text strings that appear on either side of the value that you are looking for so that it can be found. The flow of logic for this is the correlation functions tell the replay engine, what to look for in the next set of replies from the server. The replay engine makes a request of the server. The server replies. The replay engine looks thorough the replies for the left and right boundaries. If it finds them, what is in-between is then saved to a parameter of the name specified. Remember, the parameter cannot have a value until after the next statement is executed. The correlation statement only tells the replay what to look for; it does not assign a value to the parameter. Assignment of a value to the parameter does not happen until after generating request to the server and looks in the reply. If you have a case where a correlation statement is followed by a function that attempts to use the parameter, the statement is in the wrong place, and the script will fail. This is always incorrect: web_reg_save_param(…); Web_submit_data (… {Parameter}…); In-between the two, there needs to be a request of the server that causes it to reply with the value you are trying to capture.

Page 13: LoadRunner Correlation

Replacing the hard coded value in the script with the parameter Once you have created the parameter, the next step is to replace the hard coded occurrences with the parameter. Look through the script for the original value. Where you find it, delete the value and replace it with the parameter. Note, only the value you want replaces is deleted. The characters around it remain. EXAMPLE: Change:

… "Name=userSession", "Value=75893.0884568651DQADHfApHDHfcDtccpfAttcf", , ..... …

To:

… "Name=userSession", "Value={UserSession}", ENDITEM, …

At this point, you are ready to run the script to test if it works, or, if it needs further correlation, more work on this correlation.

Page 14: LoadRunner Correlation

Common errors when correlating When the replay fails to find the boundaries for a web_reg_save_param statement, it will print a warning message in the execution log like this: Action1.c(x): Error: No match found for the requested parameter "Parameter Name". If the data you want to save exceeds 256 bytes, use web_set_max_html_param_len to increase the parameter size Action1.c(x): web_url highest severity level was "ERROR", 47489 body bytes, 1518 header bytes By default, web_reg_save_param function issues an error if the boundaries cannot be found. If you need to use this function for other purposes, and would like to avoid errors, use the "Notfound=warning" attribute so that the replay will just issue a warning. Now, the replay actually suggested 2 possible causes for the error,

1. The boundaries defined cannot be found 2. The data you want to save exceeds 256 bytes

The advice given are just recommendation that you must take into consideration. Was the value you were trying to capture more than 256 characters long? In the above example it was only 40 characters long. Have a look at the Recording log and see how long the original value is. Have a look at the second recording made earlier and see how long it was in that script. Turn on the extended log (Run-Time Settings → Log → Extended log → All data returned from server) and have a look at how long it is in the Execution log. If all occurrences of the script were much shorter that the maximum parameter length, then the problem is either with the web_reg_save_param in the wrong place, or that the boundaries are incorrect. Go back and look at the boundaries that you selected, look at the placement of the web_reg_save_param. Is it immediately before the statement that causes the server to reply with the data you are looking for? If, at any time, any of these values was close to being around 200 characters, or exceeded 200 characters, then add a web_set_max_html_param_len statement to the start of the script to make the maximum longer than 256 characters. For example,

web_set_max_html_param_len ( “length” ); This statement tells the replay engine to look for larger matches for the left and right parameter. When it finds the left boundary, it will look ahead the max parameter length for the right boundary. This setting is script wide and takes

Page 15: LoadRunner Correlation

effect from when it is executed; it only needs appear in the script once. Looking for longer matches uses up more memory and CPU processing to search through the returned text from the server. For this reason, do not set it too high, or you will be making your script less scalable. That is you will reduce the number of Vusers that can run it on a given machine. Try to have the maximum parameter length no more than 100 characters greater that what you are expecting. Special cases for the boundaries: There are some special characters and cases when specifying the boundaries. Double quotes should be preceded by a “\” so that it can be recognizes as part of the string to look for. If your text includes any carriage returns, that are part of the http, and not just part of the wrap around in the recording log, these need to be specified as a “\r\n” character set. If the “\” character is part of the text, it too needs to be preceded by a “\” to indicate it is a literal.

Recording log Left boundary Right boundary Value=”57685” “Value=\”” “\”” Value_”\item\”value’7875’ “Value_\”\\item\\\”value’

“’”

Value= “7898756”

“Value=\r\n\”” “\””

Debug help Sometimes you want to print out the value that was assigned to a parameter. To do this, use the lr_eval_string function and the lr_output_message function. For example, to print the value of the parameter to the Execution log: lr_output_message(“ Value Captured = %s", lr_eval_string("{ParameterName}")); If you find that the value being substituted is too long, too short, or completely wrong, printing out the value will help identify the changes you need to make to the correlation function. If you have extra characters at the start of the value, you need to add them to the end of the left boundary, if you have extra characters at the end of the parameter value, you need to add them to the start of the right boundary. If you are getting the wrong value all together, do searches in the Recording log for the left boundary, make sure that you have a unique boundary and that LoadRunner is not picking up an earlier occurrence. You can then use the ORD=x attribute to specify which occurrence you want to capture if the boundary is not unique.

Page 16: LoadRunner Correlation

Other Correlation help resources The examples in the Function reference contain a lot of data and examples on how to use these functions. I would recommend looking over them. The Customer Support site has a video for download that goes over correlation. You can get it from http://support.merc-int.com. After logging in, go to Downloads → Browse. Select the Mercury Interactive downloads radio button, choose “LoadRunner” from the product selection drop-down box, and click on the “Retrieve” button. Under “Training,” select the “LoadRunner Web Script Correlation Training” link.