Import Filters  Scripts  Thesauri

Total Records: 25 Last Updated
Name Filename Last Update
Clean Derwent Assignees
Notes: Change Derwent codes into their most common assignee name unless they are a) -N or -R codes or b) -I codes in a record with a non -I code.
Requirements: VPv7.1 minimum; Derwent WPI data with "Patent Assignee Codes" and "Patent Assignees (long)" fields
Close All Views
Notes: Closes all active views in your dataset - only the Summary sheet will remain.
Color Cross-Correlation Matrix
Notes: Color code cross-correlation matrix based on whether there are shared records or not.
Requirements: A cross-correlation matrix is open.
Usage: Just because cells in a cross-correlation matrix have a positve number does not mean that the terms actually appear together on any records. This script will color in all the cells with actual records.
Combine All Open Datasets
Compound Field Filler
Notes: Look for empty country items in an assignee/inventor compound field. Fill with value from neighboring, matching rows.
Concatenate Fields
Notes: Concatenate two fields in a record into a new field. The second field can have multiple values per record.
Copy Record Classification
Notes: Exports Record Classification Knowledge Bases then Imports them to another open dataset. Runs a list comparison between the datasets on a matching ID field (e.g. Accession Number) to assign records in Target database to same categories as in the Source.
Count Matrix Rows with Multiple Column Values
Notes: Given a matrix, count the number of row items that co-occur with more than one column item. Results returned in a popup.
Requirements: A co-occurrence matrix is open with more than one column
Usage: Using cleaned lists, quickly count the number of authors that appear with multiple organizations or patents with multiple assignees.
Count-Frequency Plots
Notes: Create a Count-Frequency plot for each field in the dataset. Attempts to fit a 2nd order polynomial to the log of the distribution.
Requirements: VPv14 Dataset is open
Usage: Intended as a seed for bibliometric exploration of distributions in NLP, keyword, classification, and other fields. see Zhiqiang Bi, Christos Faloutsos and Flip Korn, "The DGX Distribution for Mining Massive, Skewed Data", KDD 2001, San Francisco, CA, August 2001
Create All Lists
Notes: Opens a list of every field in the dataset.
Create Full Coverage Group
Notes: Creates a group that includes as many records as possible from as few terms as possible.
Requirements: VPv10 Will prompt for a field if list is not already open.
Usage: User chooses to add terms to the group from the top (most frequent) or bottom (least frequent). Script then does a second pass trying to remove terms but keep full coverage. Script can attempt to make multiple, non-overlapping groups of terms.
Create Record Table
Notes: Creates a table view of a record from user-specified fields.
Requirements: Version 8.0 Minimum
Export List Terms and Groups to Excel
Notes: Send a List with Groups to Excel as a two column list of Terms & Groups
Requirements: A list with groups exists
Export Matrix to Text
Notes: Export an open matrix matrix to tab-delimited, Unicode text file
Requirements: Matrix is open.
Export Title List
Notes: Export to MS Word a formatted list of titles in each group from the selected field.
Requirements: A title field exists and groups exist in the selected descriptor field.
Usage: When there are too many records to read in a fancy printed list, export just the titles. This is specifically intended for exporting factor groups since the titles are sorted by the number of grouped terms they include. If you allow titles to be exported to multiple groups, titles will be prefaced with an asterisk after they have appeared in another group. You also have the option to list titles not affiliated with any groups at the end of the document if you want the document to include every title in the dataset.
Export To Gephi
Notes: VP to Gephi.pdf file included with step by step guide for use, will appear in the Macros\Resource folder.
Requirements: VantagePoint version 14. Latest version of Gephi is installed
Field to Classification
Notes: Create a Record Classification from a field of terms.
Make Earliest Date
Notes: Go through a list of dates. Put the first one for each record into a new field.
Requirements: Dates are expected to be in YYYY-MM-DD or YYYYMMDD format. A field with years or other non-date information will cause errors.
Usage: Use to create a field with, for example, the earliest priority date from a patent family record.
Matrix Column Cross Product
Notes: Export to Excel a cross product of row items by column
Requirements: A matrix must be open.
Usage: Use with a cross-correlation matrix to evaluate the total "connectedness" of each column, for example, in a cross-correlation of organizations by subject terms Note: Script is limited to Excel's maximum of 256 columns.
Matrix Column Sum
Notes: Export to Excel a count of number of row items per column
Requirements: A matrix must be open.
Usage: From a matrix of classifications (rows) by organizations (columns), find the number of unique and total classifications applied to each organizations' records. Note: Script is limited to Excel's maximum of 256 columns.
Pivot Tool
Requirements: VantagePoint v16
Plot Fields
Notes: Create a Rank by # of records plot for each field in the dataset. Estimate power law coefficients.
Requirements: Requires VantagePoint v15.1. Dataset is open
Usage: Makes a series of small plots in a browser sheet, displayed in a sortable table.
Profile Table
Rebuild Dataset
Notes: Export all fields to a generic XML format then re-import everything. Used to reset all raw records and fields in datasets that may have corrupted fields.
Requirements: A dataset is open.
Terms By Year
Notes: Create new field with groups showing which terms appeared in which year (first, last, or all). Offers option to export results to an Excel table.
Requirements: Requires VPv15.1 There is a cleaned year field with at least two items.
Usage: This handy script groups terms by the year they appeared. Grouping by first year is a good way to find who or what only appeared in the dataset recently and what has been around for a while. Grouping by last year can show you which terms have depreciated and are no longer in use. In addition, if you export the Excel report, you can get a total of how many grouped terms appeared in each year. This can be used as an indicator of the maturity of the set (see the "maturity" section of the Analysis Guide).