Jump to content

Module: Import Pages from CSV file


ryan

Recommended Posts

I could not use the date format of d/m/Y that I wanted no mater how hard I tried. As an example of the strange results I was getting here is the code I used on the event page - the results are in the comments :

echo $config->dateFormat; // Y-m-d H:i:s
echo '<br />';
echo $page->event_start;  // 21/01/2012
echo '<br />';
echo $page->event_end;  // 15/04/2012

$date1 = strtotime($page->event_start);
$date2 = strtotime($page->event_end);

echo $date1;  // empty
echo '<br />';
echo $date2; // empty
echo '<br />';

$diff = $date2 - $date1;

$no_days = floor($diff / (60*60*24)) ;

echo 'duration : ' .$no_days;  // 0

This may have been a result of changing the date field formats too many times.

I deleted all the fields and created them again with default US settings. I then changed all my dates in the CSV to US style m/d/Y re-imported and now it works as expected!

I did not try the d-m-Y format yet (I could not find any country that uses this format in the OSX dates preferences) - it was easier to change to US date settings.

On a side note, I always thought the US date format was odd - the logical way would be small > medium > large (day > month > year). Are they are the only country that starts with a month ?

http://en.wikipedia....rmat_by_country

Thanks,

Michael

Link to comment
Share on other sites

I don't know why the US date format is the way it is. The d/m/Y format does seem more logical. Maybe the US date format is meant as some kind of calendar hierarchy? The date format I prefer the most is Y-m-d, but mainly because it's string-sortable. :)

You don't necessarily need to go changing settings in your spreadsheet if you don't want to. Another way to convert your d/m/Y dates to d-m-Y (dashes rather than slashes) is to load the CSV into a quality text editor and then use a regexp to swap the slashes for dashes. Here's the search and replace I'd use in TextWrangler (BBEdit):

(\d{2})/(\d{2})/(\d{4})	 =>	\1-\2-\3

PHP's strtotime() function treats dates with dashes as European format and dates with slashes as US format. So once you convert those dates in your CSV to use dashes rather than slashes, they should import just fine with the CSV import module.

Link to comment
Share on other sites

Thanks for the regexp tip. I am trying to keep a simple workflow for business users where all data preparation can be done in spreadsheets (excel / numbers / google docs) and then imported to Processwire. I'm a big fan of using spreadsheets as a middle ground or starting point for building applications especially where lots of data is involved.

I've not tested yet, but will the import work with the new multilingual fields? In the past, spreadsheets have been a useful way to translate content with multiple people.

Link to comment
Share on other sites

It's been a while - sorry for that!

I have just tested this module again in a live setup (I was previously testing on a localhost setup). I am still getting this 'problem' where upon installing the module 'Import Pages from CSV' I am simply returned to the module information screen with a highlighted message saying: 'Click to your 'Setup' page to start using the CSV Importer'. I am given no clue as to having to change the 'Process' dropdown for that module setup page in the page tree.

I take it this isn't the designed behaviour?

My localhost testing environment is (if any of this helps) a MAMP installation using PHP 5.3.2 and my live site setup is an Apache installation using using PHP 5.2.17

Link to comment
Share on other sites

It sounds like we're using the exact same localhost testing environment.

This module creates a page: Admin > Setup > Import pages from CSV

Looking at the install function, it won't assign the process if there is already a page there named 'import-pages-csv'. If you've already got a page there with the same name, you'll have to edit it and select 'ImportPagesCSV' for the 'process' field and then hit save. However, assuming you don't have a page there named 'import-pages-csv', then it should take care of it all for you and be ready to use right after you've installed.

Link to comment
Share on other sites

Thanks Ryan,

I did copy over a lot of the content from my localhost - so that could explain a page already being there (assuming that's the case). Does uninstalling the module remove it's setup configuration (removing the page and processes)?

It's not a big enough issue to warrant spending much time on it - especially as I now know how to get around it.

Link to comment
Share on other sites

Uninstalling should remove the page and process. Also there's no harm in manually removing the page either. But if you just want to add the CSV import process, then I would just edit the existing page that's already there and select 'ImportPagesCSV' and save, and that'll be the same as if it had installed it.

Link to comment
Share on other sites

Thanks a lot for making this module, ryan. I've already used to to import over 400 pages... huge time saver.

The one suggestion I have was touched on by one or two people earlier in the thread: the importer doesn't work when the page is missing a 'title' field, presumably because it's used to generate the page name. It's pretty easy to get around by making a dummy title field, populating it with a CSV column, and then deleting it after importation, but it would be really great if during Step 2 we could specifiy a column from the CSV to use for generating a page name/slug.

Link to comment
Share on other sites

  • 4 weeks later...

Idea of ​​such exports is great! And thanks to Ryan for this!

As for me: I'm trying and trying! What is the problem, doc?

What I'm doing wrong or is this a known problem?

If i try to import returns: «Unable to import page because it has no required 'title' field or it is blank.»

My file structure is as follows:

"title"

"Yakimanka"

"Yaroslavl"

"Yasenevo"

What could it be?

Link to comment
Share on other sites

Thanks AlexV! This module was written before multi-language support in PW, and hasn't yet been updated to support multi language title/text/textarea fields (though does support multi language alternate fields). However, even if it had been updated, you would still need a title specified for the page in the 'default' language (since it is a required field). It doesn't care what language the 'default' language is, whether English, Russian, etc. But when you are importing with this module, it is assuming that default language for any multi-language fields. It sounds like in your case, you may want to have Russian as your default language, and English as an add-on language. That way you don't have to translate to English before importing.

Link to comment
Share on other sites

Most likely you got a page title that is empty, thus the error. Every page in PW needs at least the "title" field which is mandatory. You can't save a page without giving it a title. Is there any entries with no title in your csv?

Link to comment
Share on other sites

  • 3 weeks later...

Ryan,

Many thanks for this module, I just used it to add 436 users.

There were a small handful of things that weren't initially clear, so I detailed them below for anyone else trying to import users.

If you plan to import passwords, you need to open the module and add FieldTypePassword to $fieldtypes

protected $fieldtypes = array(
'FieldtypePageTitle',
'FieldtypeText',
'FieldtypeTextarea',
'FieldtypeInteger',
'FieldtypeFloat',
'FieldtypeEmail',
'FieldtypeURL',
'FieldtypeCheckbox',
'FieldtypeFile',
'FieldtypePassword', // add this line
);

Since users are pages and all pages require a title, your CSV will need to have a title column.

In my case, I duplicated all the usernames into that column — so name and title are the same.

In order for title to show as a connection option during your import, you need to add the title field to the user template file.

To do this, go to: Setup > Templates (open the filters area at the top, and choose "show system templates". Select the user template and add the title field.

One other thing to note, be sure to have a roles column in your CSV with roles for each user.

I forgot that during my first test import and all the users were set to guest.

You should be all set to import your users.

:)

  • Like 8
Link to comment
Share on other sites

  • 5 weeks later...

Quick question -- is it possible to automatically import a CSV file from a specified URL, using this module's functions and Lazy Cron? I couldn't figure out how on my cursory glance over the module's code. I'm trying to see if this would be suitable for updating an online inventory, or if I'd be better off rolling my own solution. Thanks!

Link to comment
Share on other sites

The module isn't intended for non-interactive use. However, the PW API certainly is (and LazyCron), and pretty clean and simple for doing stuff like this.

Link to comment
Share on other sites

For something like that, I think you'll be better off just importing from the API. Probably 70% of the time that I need to do any kind of importing of anything, I just use the API because it usually makes a quick and easy job of it, without having to consider any limitations. But if you want to get image/file descriptions in with the CSV import module, then I would probably create a temporary field to store them in, for import purposes. Then go back and paste a little code in a template to quickly convert them over:

foreach($mypages as $p) {
   $p->of(false); 
   $p->image->description = $p->temp; 
   $p->save();
}

Once you've done that, you could then go and delete that 'temp' field from the template.

  • Like 3
Link to comment
Share on other sites

  • 4 weeks later...

Question: does this import module work with the repeater module? How might the CSV look like if I wanted to import multiple fields which were part of a repeater?

Thanks in advance!

Link to comment
Share on other sites

Welcome to the forums Barry. I don't think the CSV importer could be used with the repeater field type. Too many components to put together in making that one work through a simple CSV importer. I think you'd be better off using the API on that one. See here for repeater API usage (bottom of page).

Link to comment
Share on other sites

  • 6 months later...

Is it possible yet to import a set of Pages to represent categories, then import a second set of entries including page references to those categories? I'm looking to import a large EE site with a lot of related entries. I imagine I'd import the category pages with their legacy identifier in a simple text field, and then have the importer match another field in the second import with that of the first (or something similar).

Thanks!

Link to comment
Share on other sites

@bcartier: The ImportPagesCSV-module can't do this as is. But I tried making a tiny addition to make it support FieldtypePage (those used to make page references) and it worked amazingly well. The only change needed was to add 'FieldtypePage' to $fieldtypes array (just before init() function if you take a look at the module file), like this:


       protected $fieldtypes = array(
               'FieldtypePageTitle',
               'FieldtypeText',
               'FieldtypeTextarea',
               'FieldtypeInteger',
               'FieldtypeFloat',
               'FieldtypeEmail',
               'FieldtypeURL',
               'FieldtypeCheckbox',
               'FieldtypeFile',
               'FieldtypePage', // add this line
               );

After that addition it's possible to choose a Page field when connecting the fields from the CSV to the ones in the chosen template. I had pre-populated categories at the target site and used their id's in the CSV file to reference those categories. Multiple categories worked like a charm as well, just use a pipe in between id's (123|456|789).

Moreover, if you've got only one category per entry to reference, then you don't even need the id's of the categories - you can use paths as well. Here's a little example:

cat.csv:

title
one
two
three
four

entries.csv:

title,categories
a,/cats/four/
b,/cats/three/
c,/cats/one/
d,/cats/two/

Import cat.csv using a template for categories with (at least) title field, under a page at /cats/. Then import entries.csv using a template for entries, having a title field and a page field. This should leave you with entries that are connected to categories. I hope this gets you somewhere. :)

@ryan: Looks like page references could be supported very easily. I just used this successfully to import ~3500 pages with category references from an old site to a new PW one. But maybe there's still something else to be done before they're fully supported?

  • Like 14
Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...