U.S. flag An official website of the United States government.
Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

i

Data Profiling Using Base SAS Software: A Quick Approach to Understanding Your Data

Public Domain


Details

  • Personal Author:
  • Description:
    "Data Profiling is the use of analytical techniques about data for the purpose of developing a thorough knowledge of its content, structure and quality" (http://www.bitpipe.com). While this terminology is most often associated with Data Warehousing and high-level business intelligence efforts, these techniques are valuable tools for the everyday data manager or data analyst. SAS Version 9 software offers various avenues for performing data profiling such as, SAS/ETL and SAS Data Quality Solution. These tools however, may not be available for some SAS users, may require additional training, and may be overkill if an understanding of the content of a file is all that is needed; that is, no data cleansing or other transformations are required. This paper discusses an application using only base SAS software which provides basic statistics, frequencies, ranges, outlier, and structural information for each variable in a table. The result of the application is a condensed report detailing the information about the content of a data file. The application was written using the Windows environment and can be run from the SAS Display Manager. For those who have SAS/IntrNet software, a front end is also available to provide a user friendly interface. Current enhancements under development include running the application from SAS Enterprise Guide as a stored process. [Description provided by NIOSH]
  • Subjects:
  • Keywords:
  • Publisher:
  • Document Type:
  • Genre:
  • Place as Subject:
  • CIO:
  • Division:
  • Topic:
  • Location:
  • Pages in Document:
    1-5
  • NIOSHTIC Number:
    nn:20030855
  • Citation:
    Proceedings of the 31st Annual SAS Users Group International Conference, March 26-29, 2006, San Francisco, California. Cary, NC: SAS Institute Inc, Paper 161-31, 2006 Mar; :1-5
  • Contact Point Address:
    Susan J. Nowlin, IT Specialist, National Institute for Occupational Safety and Health, 4676 Columbia Parkway, MS-R4, Cincinnati, OH 45226
  • Email:
    snowlin@cdc.gov
  • Federal Fiscal Year:
    2006
  • Peer Reviewed:
    False
  • Source Full Name:
    Proceedings of the 31st Annual SAS Users Group International Conference, March 26-29, 2006, San Francisco, CA
  • Collection(s):
  • Main Document Checksum:
    urn:sha-512:5e09248f6ae63be121b015f7d3abed41b6c5147a06b712971bd0c0e61902211d39dd4195d7e0bf411acdfe24866d221bbcfad65981612086b4fafa924ee26868
  • Download URL:
  • File Type:
    Filetype[PDF - 218.73 KB ]
ON THIS PAGE

CDC STACKS serves as an archival repository of CDC-published products including scientific findings, journal articles, guidelines, recommendations, or other public health information authored or co-authored by CDC or funded partners.

As a repository, CDC STACKS retains documents in their original published format to ensure public access to scientific information.